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(57) Abstract 

Conjugates of heparin -binding cpidennal growth factor-lilcc growth factor (HBEGF) linked, either directly or via a linker, to a targeted 
agent are provided. The targeted agent is a cytotoxic agent, such as a ribosome-inactivating protein (RIP) and an antisense nucleic acid, or 
is a therapeutic nucleic acid for targeted delivery. The targeted agent is attached to HBEGF, or via a linker, through a chemical bond, or the 
conjugate is prepared as a chimera using techniques of recombinant DNA. The conjugates are used to target cytotoxic agents or therapeutic 
nucleotides to cells bearing EGF receptors and are particularly useful for treating solid tumors, such as breast and bladder tumors, and for 
treating disorders involving pathophysiological proliferation of smooth muscle cells, such as restenosis. 
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Descriptinn 

CONJUGATES OF HEPARIN-BINDING EPIDERMAL GROWTH FACTOR-LIKE 
GROWTH FACTOR WITH TARGETED AGENTS 

Technical FipIH 

This invention is related to the preparation and use of heparin-binding 
epidermal growth factor-like growth factor (HBEGF) conjugated to a targeted agent, 
such as a cytotoxic protein or a nucleic acid. The conjugates are for use as anti-tumor 
agents, for the treatment of disorders involving pathophysiological proliferation of 
smooth muscle cells, such as restenosis, and to effect genetic therapy of cells that bear 
receptors for heparin-binding epidermal growth factor. 

BackgrmmH nf the Invention 

A major goal of treatment of neoplastic diseases and hyperproliferative 
disorders is to ablate the abnormally growing cells while leaving normal cells 
untouched. Various methods are under development for providing treatment, but none 
provide the requisite degree of specificity. 

One method of treatment is to provide toxins. Immunotoxins and 
20 cytotoxins are protein conjugates of toxin molecules with either antibodies or factors 
which bind to receptors on target cells. Three major problems may. limit the usefulness 
" ' ®^ ^"^^"noto'^'i^s-^ ' F^^^^^ with more than one cell surface 

molecule, thereby effecting IdelWery to niultipie cell types, possibly including normal 
cells. Second, even -if the! aintibody sfxicihc: the"aiitii>^dy r^ctiVe moiecule may be 
25 present on normal cells. Third, the toxin molecule may be toxic to cells prior to 
delivery and internalization. Cytotoxins suffer fromjsimilar disadvantages of specificit>' 
and toxicity ...Another limitation in the therapeiitic use of immunotoxins and: cytotoxins 
is the relatively low ratio of therapeutic to, ;foxic dosage. AdditionalI>\ it.i!may be 
difficult to. direct sufficient concentrations of the toxin into the cytoplasm and 
30 intracellular cbrnpartments in which the agent can exert its desired activity. 

. ;Given these limitations, cytotoxic therapy has been attemptedlusing viral 
vectors to deliver DNA encoding the toxins into cells. If eukaryotic viruses are used, 
such as the retroviruses currently in use, they may recombine with host DNA to produce 
infectious virus. Moreover, because retroviral vectors are often inactivated by the 
35 complement ;system,vuse in vivo is limited. Retroviral vectors also lack specificity in 
delivery; receptors for most viral vectors are present on a large fraction, if not all. cells. 
Thus, infection with such a viral vector will infect normal as well as abnonnal cells. 



BNSOOCPD: <WO_9eoe274A2 I > 



W096A)8274 




PCT/US^12205 



Because of this general infection mechanism, it is not desirable for the viral vector to 
directly encode a cytotoxic niolecule. 

While delivery of nucleic acids offers advantages over delivery of 
cytotoxic proteins such as reduced toxicity prior to intemalization, there is a need for 
5 high specificity of delivery, which is cxirrently unavailable with the present systems. 

In view of the problems associated with- gene, therapy, there is a 
compelling need for improved treatments which are more effective and are not 
associated with such disadvantages. The present invention exploits the use of 
conjugates which have increased specificity and deliver higher amounts of nucleic acids 
10 to targeted cells, while providing other related advantages. 

Summary of the Invention v ^ 

The present invention generally provides conjugates of heparin-binding 
epidermai-like growth factor-like growth factor (HBEGF) polypeptide or a portion 

15 thereof and a targeted agent. In one embodiment of this invention, the HBEGF and 
targeted agent are conjugated through a linker. Within each conjugate, there can be 
more than one HBEGF and targeted agent molecule. Preferably, in the conjugates, 
there are between one and six HBEGF and targeted agents, and most preferably one 
HBEGF mdleciile and one targeted agent. In certain embodiments, the linker is,selected 

20 from the group consisting of protease substrates, linkers that increase the flexibility of 
the conjugate; linkers that increase the solubility of the conjugate, photocleavable 
linkers and acid cleavable linkers. In certain other embodiments, the HBEGF 
polypeptide may be mammalian HBEGF or HBEGF that is modified by addition of a 
cysteine residue or replacement of a nonessential amino acid residue within about 20 

25 amino acids of the N-terminus or C-terminus. In yet other embodiments, the targeted 
agent is cytotoxic, preferably a ribosome inactivating protein, and most . preferably 
saporin. Other cytotoxic agents include a nucleic acid. 

In another embodiment, the conjugate has the formula: . targeted agentj^- 
(L)q-HBEGFjn or HBEGFn^-(L)q targeted agent,^, wherein n and m. which may be the 

30 same or different, are at least 1 . 

In another aspect, methods of treating HBEGF-mediated 
pathophysiological conditions, comprising administering to the animal a therapeutically 
effective amount of a conjugate between HBEGF and a cytotoxic agent, are provided. 
In certain embodiments, the condition is a dermatological disorder involving epidermal 

35 cells, a neoplastic disorder of epidermal or mesodermal cells, an ophthalmic disorder 
involving proliferation of epithelial cells, or a disorder characterized by proliferation of 
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- smooth muscle cells. Methods are also provided to inhibit proliferation of cells bearing 
HBEGF receptors, comprising contacting the ceils with an effective amount of a 
-;, IfflEGF targeted.agem conjugate;. ' ' - i^- ^' ' 

:'■ 1 -. a-..^ In .y«?.other aspects} methods oreffectmg gene^ are, provided, 

5 ;^.M*wmx»llsarec*miacted VWthacoiy is a nucleic 

" ^ aiijirtl»'ccmjiigatt-ittctoj6s^a'-h&tear to the targeted 

r/-. •:. nucleic ■aci^ofi-HBEGF. >■•'-'• '■■ -'.'i' -. :,J .itui.Hqr- 

> f s :*/a v- In-yetrother aspects,^NA fragrii^^ encodnig a conjugate between a 
„. targeted agent and HBEGF^ provided;' In derkh^^ the DN A conjugate 

10 may additionally comprise a ilinker. ^ Plfeidsi vmdfs; alid h^^^^^ provided. 
In another embodiment, methods of producing a fusion protein of HBEGF. and a 
targeted agent are provided comprising (a) cuitmirig cells ir^^^^^ wiUi a plasmid 
containing a DNA fragment according to claim 2.1, mder conditions, .whereby the DNA 
fraigniiem is transcribed and translated; (b) lyising the cells in a .buffer containing urea; 
15 (c) eluting the protein frdm a cation-exchange chromatography resin; (d) passing the 
protein over an iniori-exchange chiromatography resin; (eVeluting . the' protein from a 
V cation-exchange chrdinatography Vesiiii ^^^^^ protein fh>mi a hydrophobic 

intention airofriatogV^phy fesihi'lid ?g)"i^oyering the p«jtein from' a siW exclusion 
-■^-■'icHre^t^^hy^esffi;^"- "^-^^^ ''^ -^"-^-^ ''^'''P'' ^--^----^'^^ -^^-'- '--^ 

20 ;s->! - 'In bth^r embo^iiriem^^^^^^^ 

' ' the 
' mserted residue reijlafcbs a iionessehtial residue in the unmpdi/ied HBEQF. 
' ' " ;Phafeaceiiticar comj^^^^ 

-conjugate and a'physiologicMacceptabie excip are also provided. , . 
■ ' ' other aspects of the 

^' iipoh refetence to the'Ydllovwng delaiied de^^ and attached dravvings. In addition, 
varioiis references are set foi^ below which d«cri% m m^ certain procedures 

or coiTiiwsitions,irtdWe thCT^^ incorporated by reiference.in their entirety. 

30 Detailed Description nf t he Tnvpntinn ■ •• 

(i Delinitibhs ' 

. . / : i b Unless defined Otherwise, all technical and scientific- teriris used herein 
■ have the same meaning as is commonly understood by one of skill in the an to which 
the subject miatter herein belongs. All U.S. patents and all publications mentioned 
35 herein are incorporated in their entirety by reference thereto. 
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The "amino acids" are identified according to their well-knouTi. 
three-letter or one-letter abbreviations. The nucleotides, which occur in the various 
.DNA fragments* are designated with tJie standard single-letter designations used 
routinely used in the art. 
5 As used herein, to "bind to a receptor" refers to the. ability of HBEGF to 

detectably bind to such receptors, as assayed by standard m vitro assays. For example, 
binding, as used herein, measures the capacity of the HBEGF conjugate or HBEGF 
polypeptide to specifically bind to a, HBEGF receptor (known as EGF receptor) on 
smooth muscle or epidermoid cells, such as A431 cells, using a procedure substantially 

10 as described in Moscatelli (1987) J. C^ll Physiol, yi/:123-130. Briefly, cells are grown 
to subconfluence and incubated in appropriate, buffer with detectably labeled, such as 
radioipdinatcd HBEGF polypeptide iri the presence of various concentrations of an 
H3EGF polypeptide of interest. Binding afTmity is measured by counting the 
membrane fraction that is solubilized in a suitable bufferxontaining a detergent, such as 

15 in0.5%TritonX-100inPBS(pH8.1). 

As used herein, the term "biologically active," or reference to the 
"biological activity of a cytotoxic conjugate of HBEGF," such as a conjugate containing 
HBEGF and saporin refers to the ability of such polypeptide to enzymatically inhibit 
protein synthesis by inactivation of ribosomes either in vivo or in vitro or to inhibit the 

20 growth of or kill cells upon internalization of the saporin-containing polypeptide by the 
cells. Such biological or cytotoxic activity may be assayed by any method known to 
those of skill in the art including, but not limited to, the in vitro assays that measure 
protein synthesis and in vivo assays that assess cytotoxicity by measuring the effect of a 
test compound on cell proliferation or on protein ^ synthesis. Particularly preferred. 

25 however, are assays that assess cytotoxicity in targeted cells. 

As used herein, "biological activity" refers to the in vivo activities of a 
compoimd or physiological responses that result upon in vivo administration of a 
compound, composition or other mixture. Biological activity. . thus, encompasses 
therapeutic effects and pharmaceutical activity of such compounds, compositions and 

30 mixtures. Such biological activity, however, may be defined with reference to 
particular in vitro activities, as measured in a defined assay. Thus^ for example, 
reference herein to the biological activity of HBEGF or fragment thereof refers to the 
ability of the HBEGF to bind to cells bearing HBEGF receptors and internalize a linked 
agent. Such activity is typically assessed in vitro by linking the HBEGF (or fragment) 

35 to a cytotoxic agent, such as saporin. contacting cells bearing HBEGF receptors, such as 
A43 1 cells, with the conjugate and assessing cell proliferation or growth. Such in vitro 
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activity should be extrapblatable to rn v/vo activity. /« v/vo activity may be assessed 
using' recognized animal models, such as the mouse xenograft mibde! for anti-tumor 
activity (5ee, e.^., Beitz ei al. (1992) Cancer Research 52:227-230* ftoiighton et al. 
(1982) Cancer Res. ^2.535-539; Bogden et al. (1981) Cancer rPhiidtJelphiar48:\0-20-, 
y--liodgem^ P;871-879; Stastny et al. 

-'y'''(\99h)CatKer'Ry.53:57m^ " 

\ > : 0 t'jf,.'»? - X.k'iisfe'd- herein, a "conjugate"' refers to si iholeciife that contains at least 
' one' HBEGFihw^^ that are' linked dir6ctiy or via a 
' ■ linker and that sire produced by chemical couplitig methods or by recombinant 
10 " expression dif chiniferic DN A mbleib^^^^ ' '\ 

• • " => . : As iiscd 'heiffeiri/th'e ti^ra^^ 

Of ihiiibitmg cfell fvitnction. " fhfe ageiit'- liiay^ inhibit cell ' growth: diffbrentiaiion or 
pfbliferaiioh or liiay be'toxic to' celts! I^is term' includes ageihts' thatl when internalized 
by a celli interfere with or detrimentally alter delliiliir mebbblism' or in any manner 
15 inhibit cell growth or proliferation. The' tern includes' agents whose toxic effects are 
mediated when trainsported into the c^ll and also those whose toxic effects are mediated 
- • at the cell kuffade. '"^A v^ety cifcytbtoxiic agents can be Used and include those that 
' ' inhibit protein synthesis arid those tliat inhibit expre'ssibh of certain genes essential for 
ceHular grovvth or survival. ......... . . ~ ■• 

■ ' 20~ ■■'■^'■)--' ' ' As' us6d h6rein^ "DNA enc^^^^^ p^ptiae'' or polypeptide" 

■ ' ■' ' referV to any DN A fragment encbding an Exemplary DNA 

fra'gWnfs ihdude: W^^^ such DNA fragments knoWn to '^thbse of skill in • the art; any 
■bNA fraghiem 'that enc^'des ii HBEGF tliat binds to 'an HBEGF receptor and is 
internalized thereby and may be' isolated cell libi^ry usiiig any Of the 

25 preceding DNA fragni^ts" as^a' ^jfobfe'; aficl iany DNA fragmeiit that encodes any of the 
' HBEGF polypeptides sfet forth in SEQ ID Nr6s.=2o: ' Stich DNA sequences encoding 
" ' HBEGF fragments 'are Available from piiblicly a(ic;«issible databases;' such as: DNA 
:;. . ij^jjy ,993 release froni 'DNASTARl fnc. Madiisori. Wl, and Genbank Accession Nos. 
• " Ki93012 (mohkey)'and M60278 (human); the plasmid pMTN^HfiiEGF (ATCC #40900) 
' 36' "'"and pAX-HBEGF (ATCC #40899) described in published International Application 
'• ' 'W (see. also: the corresponding U.S. Patent upon its issuance): and 

' Abi=aham et al. (1993) Biochem. Biophys. Res: Comm 790:125-133). DNA encoding 
■ 'HBEGF' polypeptides will, unless' modified by replacement of degenerate codons, 
' hybridize under conditions of at least low stringency to DNA encoding a native HBEGF 

• 35 (e.^r.; SEQ Id NO.-l).' In addition, any DNA fragment that may be produced from any 

of the" preceding DNA fragments by substitution of degenerate codons is also 
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sequence of HBEGF polypeptides, and DNA fragments encoding such peptides, are 
available to those of skill in this art, it is routine to substitute degenerate codons and 
produce any of the possible DNA fragments that encode such HBEGF polypeptides. It 
is also generally possible to synthesize DNA encoding such peptides b^ on the 
amino acid sequence. 

As used herein, a "fusion protein" refers to a poi>'peptide that contains at 
least two components, such as a HBEGF polypeptide and a targeted agent, a HBEGF 
polypeptide and linker, or a HBEGF polypeptide, linker, and targeted agent, and is 
produced by expression of DNA in a host cell. 

As used herein, "heparin-binding q)idermal growth factor-like growth 
factor" (HBEGF) polypeptides refer to any polypeptide tiiat binds to in HBEGF 
receptor, and is transported into the.cell .by virtue of its inteiaetion witfi the^rtsceplor. 
Native HBEGF has a h^arin-binding dom^. <In particular, HBEGF refers to 
polypeptides ha>ang amino acid sequences of a native HBEGP porypeptide, as well as 
HBEGF polypeptides modified by amino acid siibstitutions,* deletions, insertions or 
additions in the native protein, but retains the ability to bind to a HBEGF receptor and 
to be internalized in a cell bearing such receptor. Such HBEGF polypeptides include, 
but are not limited to himian HBEGF /SEQ . ID NO. 2), monkey HBEGF (SEQ ID NO 
4) and rat HBEGF (SEQ ID, NO. 5). Reference to HBEGFs is intended to encompass 
HBEGF polypeptides isolated from naniral sources as well as tiiose made synthetically, 
as by recombinant means or by chemical synthesis. This term also encompasses the 
precursor forms, such as those set forth in SEQ ID NQs. 1, 2, 4, and 5, and manire 
forms, such as that set forth in SEQ ID No, 3. HBEGF also encompasses muteins of 
HBEGF that possess the ability to target a- targeted agent, such as a cytotoxic agent, 
including but not limited to ribosomc- inactivating proteins, such as saporin, light 
activated porphyrin, and antisense nucleic acids, to HBEGF-receptor expressing cells. 
Muteins of HBEGF include, but arc not limited ta, those produced by replacing one or 
more of the cysteines witii serine as described herein or those that have any other amino 
acids deleted or replaced, as long as the resulting protein has the ability to bind to 
HBEGF-receptor bearing cells and internalize the linked targeted agent. Typically, 
muteins will have conservative amino acid changes, such as those set forth below in 
Table 1. DNA encoding such muteins will, unless modified by replacement of 
degenerate codons, hybridize under conditions of at least low stringency (1 X SSPE or 
SSC, 0-.l% SDS, 50'»C, medium stringency; 0.2 X SSPE or SSC, 0.1% SDS. 50*0: 
high stringency: 0.1 X SSPE or SSC; 0.1% SDS, 65°C) to DNA encoding native 
HBEGF (e.g., SEQ ID NO. 1) and encode an HBEGF polypeptide, as defmed herein. 
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. As used herein, "mature HBEGF" refers to processed HBEGFs. Various isoforms of 
, matirre HBEGF have variable N-terinihi, and include, but are not limited to, those 
: .! .^.^vingvN^ejmini con-esponding to 'amino acid pbsh^ 77 and 82 of the 

. pr^yasorpTC>tein(see,.e.gj, SEQ ID Nos: i , 2, 5ee o/jo SEQ 

5 herein a "portion of a HBEGF" refers to a fragment or piece of HBEGF that is sufficient 
y , ^9 bin<i,tq,a,reeepjtor'to which native HfiEGF "binds and internalize a linked targeted 

^ V . -.agent,..:.; L. -.or;^ r . > - " , 

: ^= AS:llsed:>he^ein,^"HBEGF-^^^ 
to a deleterious condition characteriiSed by or 'csiiised by prolifei-ation of cells that are 
10 sensitive ,to (HBEGF -mitdgehic ^ytiiriiilation. HBEGF-media^^^^ pathophjfsiological 
. conditions : include,^ ' but i iare ' not : lifnitecl to^ ' VoAditibris involving . pathophysiological 
proliferation o^.smooth muscle cells,- sudb as restenosis^ certain ti^mprs, such, as solid 
^iimors including, .b^^ bladder turribrs; tiimors involving pathophysipjogical 

expression of EGFv receptors, denhatoloigical disorders,; such as psqriasis, and 
.15 . ophthalmic disorders involving epithelial cells,' such as recuirence of pterygii and 
secondary Jens clouding. ' - ^ • ' ^ ' 

r.;,.; thatTeacts \vith members of the' HBEGF YamH transport 
, , ^^.HBEGI^ into the cell; Fdr e^^ polypeptid^^^^ witl> the^Wgh affi 

20- EGF receptors (EGRrR) on bbviiie^^ditiPMcSth miu^ epidermoid 
-xarcinoma cells (see Higashiyahia et al'. (l99fYSciehce 25):936-939; Higashiyama et al. 
(1992> J: BioL Chem: 267t6265-62U): -T^^ EGF-receptors,'^w 
; Rs, include the EGF Teceptbrs described in (j.S; Patent Nos. 5 , 1 83,884 and 5.2 1 8,090, 

, Ullrich et aL (1984) A^amre:i0^^m«-425, thoise encoded by the eir*5 gene family. 
25 , . • As used hereinr^nuicieit kc^ 

: as targeted agents, .which iricliSflde; -but are not limited tori5N A encoding therapeutic 
proteinsy fragments of DNA- for co-suppression, 'DNA encoding cytotoxic proteins, 
. ; • antisense/ nucleic acids and other such molecules. Reference to nucleic acids includes 
duplex D^IAv single-stranded: DNA, RNA in any form, including triplex, duplex or 
30 . single-stranded .RNA^* anti^sense RNA, polynucleotides, oligonucleotides, single 
, nucleotides and derivatives thereof. 

, : ,v ^ Nucleic acids may be composed of the well-known deoxyribonucleotides 
or ribonucleotides composed of the bases: adenosine, cytosine, guanine, thymidine, and 
■ uridine. As well, various other nucleotide derivatives and non-phosphate backbones or 
35 . phosphate derivative backbones may be used. ' ^ " ' 
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For example, because normal phosphodiester oligonucleotides (referred 
to as PO oligonucleotides or type I; see structure, below, where X = 0) are sensitive to 
DNA- and RNA-specific nucleases, several resistant types of oligonucleotides have 
been developed (see, e.g.^ International Application WQ 93/23570, which is based on 
07/881,255, filed May 11, 1992; International Application • WO 93/15742, which is 
based on 07/833,146, filed February !0, 1992; Wagner et al. (1.993) Science 260:1510- 
1514; U.S. Patent No. 5,218,088, U.S. Patent No. 5,175,269; U.S. Patent No. 
5,1 09,124; Carter et al. (1993) Br. J. Cancer 67:869-876); these include types IWV: 



MocM, ^ B in which B is a nucleotide base; and X is OEt 

. \ / in phosphotriester (type II), X is Me in methylphosphonate 

Q o . (type III; referred to as MP oligos): and X is S in 

'^^,JLcH B phosphorothioate (referred to as PS oligos; U.S. Patent No. 

5^18,088 to Gorenstein et al. describes a method for 
preparation of PS oligos). , Presently, MP and PS 
*p' -| oligonucleotides have been the focus of most investigation. 

* I T^.^^ As used herein, a "therapeutic nucleic acid" 

U«y describes any nucleic acids used in the context of invention 
that modify gene transcription or translation. This term also 
20 includes nucleic acids that bind to sites on proteins and to receptors. It includes, but is 
not limited to the following types of nucleic acids: nucleic acids encoding a protein, 
antisense RNA, DNA intended to form triplex molecules, extracellular protein binding 
oligo nucleotides and small nucleotide molecules. A thersqpeutic nucleic acid may serve 
as a replacement for a defective gene or encode a thersq>eutic product, such as TNF or a 
25 cytotoxic molecule, such as saporin. The therapeutic nucleic acid may encode all or a 
portion of a gene, and may function by recombining with DNA already present in a cell, 
thereby replacing a defective portion of a gene. It may also encode a portion of a 
protein and exert its effect by virtue of co*suppression of a gene product. 

As used herein, "restenosis" refers to a process and the resulting 
30 condition that occurs following angioplasty in which the arteries become reclogged. 
After treatment of arteries by balloon catheter or other such device, denudation of the 
interior wall of the vessel occurs, including removal of the endothelial cells that 
constitute the lining of the blood yessels. Smooth muscle cells (SMCs), which form the 
blood vessel structure, proliferate and fill the interior of the blood vessel. This process 
35 and the resulting condition is restenosis. 
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» / . ' t .1.': U5ed herein, "substantially homologous" with reference to an HBEGF 

• - polypeptide means that the protein is more homologous (i.e., shares niore amino acid 
- ^ - - residues in common) to ahy of the mature 4^ polypeptides included in SEQ ID 

- Nos: r-6 th^ WitIV reference td BlNiA it Wans th^V the DNA^ a 

'5- * "substantially ho'mblo^^^ but W substitiition degenerate cocions, 

• '^^ 'Hybridizes uni^er cbnilitidhis of at least low stringency td DNA^ encoding any of the 

niatme HBEGFs included in tfie sequences i6l forth in SEQ ID l^Ibs. 1 -6. 

:: . \ -used herein, ' isolated, "substantially pure DNA" defers to DN A 
fragments purified according to standard techniques employed by those skilled in the art 
" 10 'Xsee, e}g,; Maniatis et al. (1982)' A>fo/£rci/tor Cloning: A Laboratory Manual^ Cold Spring 
:Harb6r Labbratory -'Ptess, Cold Spnhg Hiirbor, NY and Sambrook et al. (1989) 
' Molecular Clbning: Laboratory Manual^ Cold Spring Harbor Laboratory Press, Cold 

'•'Spring Harbor, NY.)i^ ■•' ' ' '" ' ■ ' - - - . y - 

' ' --As used 'herein, a'' "targeted ' agent" is any agent that is intended for 

• 15 intemali^tion by linkage to a targeting moiety, as defined herein, and that upon 

* intemalization 'in some raanner alteris' or affects cellular metabolism, growth, activity, 
J . . ^viability or otheir property or characteristic of the cell. The targeted agents include 
p^^ organic 'molecules, dru nucleic acids and other such 

o i 'i - m^ ia^gent means to direct it to a cell that 

• 20 ^expresses a selected receptor by liiilarig i^^^ with a 

■ ' ■ HBECjF receptor. Upbri binding to' the receptor the targeted agent or targeted agent 

• linked to the HBEGF is iritehialized by the cell: ' ' ' ' ' 

A. ' Heparin binding epidermargro 

25 ' 1. Polypeptides reactive w 

For the purposes of this invention, HBEGF need only bind a specific 
HBEGF receptor and be internalized. Any member of the HBEGF family, whether or 
not it biiiiis heparin, is useiFtir witKin the context of this invention as long as it meets the 
requirements set forth aboVe. Members of the HBEGF family ^re those that have 

30 sufficient nucleotide identity to hybridize under normal stringency conditions (typically 
greater A 75% nucleotide identity). Subfragme,nts or subpoitions of a full-length 
HBEGF may also be desirable. One skilled in the art may find from the teachings 
provided within that certain biological activities are, more or less desirable, depending 
upon the application. 

35 Heparin-binding EGF-like growth factors (HBEGFs) are mitogens in the 

epidermal growth factor (EGF) protein family that bind to the glycosaminoglycan, 
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heparin. HBEGFs elute from heparin-Sepharose columns at about 1 .0 - 1 .2 M NaCl and 
were first identified as a secreted product of cultured human monocytes, macrophages, 
and the macrophage-like U-937 cell line {see. e.g... Higashiyama et al: ( 1 99 1 ) Science 
257.936-939; Besner et al. (1990) Cc// jee^/. 7:811-819). HBEGFs, Slso called 
5 "hqjarin-binding EGF-homologous mitogen ■ (HB-EHM)" (see WO 92/06705), are a 
family of growth factors that have a broad spectrum of activities, including a mitogenic 
effect on a variety of cells, such as BALB/c 3T3 fibroblasi cells ahd smooth muscle 
cells. HBEGFs, however,, are not mitogenic for endothelial cells (Higashiyama et al. 

(1991) 5c/e«ce 2i7.-936-939). 

isolated, mature HBEGFs are heterogeneous in structure and contain 
up to 86 amino acids, including two site;s of O-linked glycosylation (Higashiyama et al. 

(1992) J. Biol. Chem. 267:6205r6212). The carboxyl-terminal- half of the secreted 
hiiman HBEGF shares approximately 35% sequence identity With hirinan EGF, and 
includes six cysteines spaced in the pattern characteristic of mcmbCTs of the EGF 

15 protein family. HBEGF interacts with the same.high affinity receptors as EGF on 
bovine aortic smooth muscle cells and human A43 1 epidemioid carcinoma cells {see. 
e.g., Higashiyama (1991) Science 257 936-939). The amino-ten-Ainal portion of the 
mature factor, which includes stretches of hydrophilic residues, has no strucmral 
equivaleiit in EGF. The heparin-binding residues of HBEGF reside primarily in a 

20 twenty one-ainino acid stretch upstream, of and slightly overlapping the' EGF-like 
domiain. HBEGF appears to be a more potent mitogen for smooth muscle cells than 
either EGF or TGF-a, which also binds to, EGF receptors. 

Mammalian HBEGFs are derived from a 208 amino acid precursor 
protein. The human and monkey precursor proteins share 97% sequence identity, the 

25 rat and mouse precursors are 92% identical; and there is 80% sequence identity between 
primate and rodent HBEGF preciflwr proteins (see Abraham et al. ( 1 993) Biochem. and 
Bidphys. Res. Comm. 7P0. 125-133). The -mature HBEGF polypeptides are 
heterogenous and range fix)m about 75-86 amino acids in length. HBEGFs have a 
molecular weight of approximately 19-23 kD, and have an isoelectric point between 

30 about 7.2-7.8. 

The effects of HBEGFs are mediated at least in part by receptor t>'rosine 
kinases on the cell surface membranes of HBEGF-responsive cells {see. e.g.. U.S. 
Patent Nos. 5,183,884 and 5,218,090; and Ullrich et al. (1984) Nature J0P;4 18-425, 
which are incorporated herein by reference). The EGF receptor proteins, which are 
35 single chain polypeptides with molecular weights of approximately 1 70 kD, depending 
on cell type, constimte a family of suucturally related EGF receptors. Cells that express 
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. the EOF receptors include, for example, smooth muscle cells, fibroblasts, keratinocytes, 
. r and numerous humari cancer cell lines; sudh'^^i^^^^^^^ (epidermoid); KB3-1 

(epidermoid); GQLO. 205 (colon)? GRIi l739 (gkstric);'HEP Q? Qiepatoma); LNCAP 
• ;.. (l>T,QS^t^);:uMCTr7]Z(bK^^ (br^aist)- Wcf''417D (iung)i MG63 

. 5 (p?t^sarcoma); vlW5a^:(gn6bl^dma); ^ (glioma); id SW^Vla 

rr ;^^W- Wadrto the heparan -sulfate prdtedgiyca^^ to internalize 

. i . boMpdiPQietiQSiyiathdrendocytic pathV^y^m^ cbhtribiife to internatlization of HBEGFs. 
. ; • / ;:: r For purposes herein; polypeptides that are re^^ 
receptor include any molecule that (1) includes a V^ceptor binding domain that is 
1 0, hom^ologous.tp EGF and that is stibstaintiaiiy^omblo^^^ (more homologous thap TGF- 
::, :Ct) tQi Sucb?.d.omaii&^'.i^ iwatoe HBEGFs hiaviiig^^ in 
SEQ ID Nos, ln5; and'(2)'react5 with receptors' bft^cells to wliich a native HBEGF la 
ynature HBEGF having an amino acid seqiieHee inclijiied in of SEQ ID Nps. 1-5) 
and results in Jntemalizatiorivof the linked agent: TTius,' the polyj^pt^^^^ are 
15 reactive v/ith a HBEGF receptor include members of ^'t^^ of 
, , polypeptides, miiteins of these polypeptides; chimeric or ^ molTCules that contain 

. : : .portions of any of these fadiiy members, and any portion thereof that W 
: ifi , : : . receptors ^d internalizes- a linked^ agent? Aiiy ^ixayp^ptide tliat has a heparin^binding 
' domain andfincludes ahl^^ refe^ptor bini[iing db^^ substant^lly homologous 

2Q ; (n^ore hpmplQgQus th^^ s^l ^&rih in any ' <^ is 

- ; . intended/pr use herein. iHBEG^F for use hereift also includes 'a^y i^ HBEGF 
polypeptide that retains the ability to bind to a HBEGF receptor and to be internalized 

. by a. cell bearing su^:h receptoti/« ' " - ^ l< . ^ -^i t ^ ' - 

... I.. .... j . '.■■^^r . . i / ■ 

r i' ^^.^r^^It ?isnllide^slood that mino variations including 

25 allelic- vanaiionsi^ species variations and cbhservativ^ Mriino acid substitutions " such as 
^ -^P?^.?et forthfin Jable 1, iri HBEGF tKatldb riot alter' its ability to bind tq HBEGF 
, receptors. an4 lo'lbe internalized by cells upon' siich binding are encompassed within the 
vfamilyjpP HBEGF polypeptides intended fo 

. J I i Mature human HBEGF as isolated has' Heterogenous amino acid lengths 
30 ranging from 75-86 (Higashiyama et ah (1992) Science 251:936-939). For example, 
.J yanpuS(isoXonTTS ;Qf matt^ HBEGF that have' variable N-termini, and include, but are 
not limited to^Xhose having N-termini corresponding to arnino acid positions 63, 73, 74. 
77 and 82 of the' precursor protein (see, e.g., SEQ ID Nos. 1 and 2, see, also SEQ ID 
No. 3, for- the presently preferi-ed fomi). A preferred HBEGF for use herein is the 77. - 
35 . amino acid form of human HBEGF beginning at amino abid 73 of the precurspr protein 
(SEQ ID No. 3, which correspohds to amino acids 73-149 of SEQ ID NOs.' 1 and 2; see 
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Example 4). Members of the HBEGF polypeptide family, including. SEQ ID NOs. 2-5, 
are particxilarly preferred. Modification of the polypeptide may be effected by any 
means known to those of skill in this art. The preferred methods herein rely on 
modification of DNA encoding the polypeptide and expression of the modified DNA. 
5 All of the HBEGF polypeptides induce mitogenic activity in a wide 

variety of cells, and this activity is mediated by binding tp an HBEGF cell surface 
receptor followed by internalization. Binding to a HBEGF receptor followed by 
internalization are the activities r?quii:ed for an HBEGF polypeptide to be suitable for 
use herein; mitogenic activity is not required. A test for binding and intemalis^ation 

10 activity is the ability of the HBEGF-loxin conjugates to kill EGF-reccptor containing 
cells' An exemplary method , for testing for such cytopathic activity is the Cell 
Proliferation/Cytotoxicity Assay described in ExanipSe 4. Any HBEGF polypeptide 
that possesses such ability is intended for use herein. 
2. Modifications of HBEGF .... 

15 If it is necessary or desired, the heterogeneity of preparations of HBEGF 

polypeptide-containing chemical conjugates can be reduced by modifying the HBEGF 
polypeptide by deleting or replacing a site{s) (that are non-essential for binding and 
internalization) on the HBEGF that cause the heterogeneity and/or by modifying the 
targeted agent Such sites are typically cysteine residues that» upon folding of the 

20 protein, remain available for interaction \yith other cysteines or for interaction with 
more than one cytotoxic molecule per molecule of HBEGF polypeptide, but that are not 
required for binding to HBEGF receptors and internalization. Such cysteine residues do 
not include any cysteine residue that are required for proper folding of the HBEGF 
polypeptide, or for retention of the ability to bind to a HBEGF receptor and internalize. 

25 For chemical conjugation, one cysteine rjesidue that, in physiological conditions, is 
available for interaction, is not replaced because it is used as the site for linking the 
cytotoxic moiety. The resulting modified HBEGF is conjugated with a single species of 
targeted agent, such as a RIP, antisense nucleic acid or therapeutic nucleic acid. . 

The contribution of each cysteine to the . ability to bind to HBEGF 

30 receptors may be determined empirically. Each cysteine residue may be systematically 
replaced with a conservative amino acid change or deleted. The resulting mutein is 
tested for the requisite biological activity, the ability to bind to HBEGF receptors and 
internalize linked targeted moieties. If the mutein retains this activity, then the cysteine 
residue is not required. Additional cysteines are systematically deleted and replaced 

35 and the resulting muteins are tested for activity. In this manner the minimum nimiber 
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■ and identity of the cysteines needed to retain the ability to bind to a HBEGF receptor 
arid internalize may be determined. ' . . - 

... . ' ; Polypeptide-may also be modified'by addition of one or 

: "^i^^J^^^Kf^ l^r^-^y^:^^ P-ferabiy tke N^erminus, in 

^ ^"^JX^''^""^ ^eriabl^ to chemical conjugation by providing a readily 

• availible W&sehtid^ HBEGF is modified herein by addition of Cys 

-corijbgation:-A^V;rt^^^^^^ replacement of one or 

more cysteine Residues that are not r^uired for binding to ,a HBEGP: receptor and 
10 mterrialization of tht targeted agent.' These modified forms of HBEGF are ^particularly 

■ suitable'fdi^chemi&ai cbnju^^^^ 

Mutation rri^y be effected by kny metliod'known to those of skill in the 
art, including site-specific or site-directed mut^enesis of DNA encoding the protein 
and the use of DNA amplification methods using primers to introduce and amplify 
' 15 - alterations in the DNA template, such ak nudleic acid ariiplification splicing by overlap 
extension (SOE): Site^specific mutagenesis is typically effected using a j^hage vector 
' that^has smgle^ and double-stranded fbfms,=such^^^ 
■■■■■^ U - known arid cohmiercially avaUaibie/^' Oth^ siiitable^^^^^^^ tK^t ^^ontain a single- 
■■■■■■ stranded phage ongiri" 'of rdplifcatibri^ r^i^^gg iis6d (^e. e!^r VeiraWaLll987) MsJiL 
C20 EnzxmeL^l^:3). ^=In geherair tiy'i;^aring a 

- ^ sirigfe^stranded vectbr'thm encodes the i,roteinWint^ry (i^^i^ HBEGF 
' ' . > ^-family- bf cj^btoxic mdiebffli^ suih as^ k saporin); ' An oligonifcleotide primer that 
' • edntams the' desired mutaf ion A^tiiiri aiegiciri of hbmology to the DNA in the single- 
' ' • stranded-vector is 'inriedled to the vector foHoSired by kddition of a DNA polymerase 
'25 'such as H CO// polymerase I Kletibw fragrii^ri^- which uses the double straiided region as 
• ^'- a primer-^^to produce r hetWodu^lex in strand encodes the aitered sequence 

^ - • and the bthfer the ofig'mai sequerice. The heterbduplex is intraluced into' appropriate 
bactenal cells and dories thahliciudis the dew^^^^^^^ 

- > altered DNA nioledyes Way be ^expressed recombinantlV in appropriate host cells to 
' 30 ■ prtfduce-the-ihodified p^^^ ' - ' • ■ 

the SOE riiethod uses two amplified bligoriiiclebtide products, which 
have cohipleriieritaiy ends as primers and which include an altered codon at^ tiie locus at 
- which thfe mutation is desired, to produce a hybrid product. A second airipiification 
• ' reaction that uses' two prinYers that anneal "at the non-overiapping end^^plify the 
35 hybrid to produce DNA that has the desired altc^tion. 
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Suitable conservative substitutions of amino acids are known to those of 
skill in this art and may be made generally without altering the biological activity of the 
resulting molecule. Those of skill in tliis art recognize that, in general, single amino 
acid substitutions in non-essential regions of a polypeptide do not substantially alter 
5 biological activity (see, e,g„ Watson et al. Molecular Biology of the Gene, 4th Edition, 
1987, The Benjamin/Cununings Pub. co., p.224). Such substitutions are preferably 
made in accordance with those set forth in TABLE 1 as follows: 

TABLE 1 

10 



Original residue 


Conservative substitution 


Ala(A) 


Gly; Scr 


Arg(R) 


Lys 


Asn(N) 


Gin; His 


Cys(C) 


Ser; neutral amino acids 


Gln(Q) 


Asn 


. . .GIu(E) 


Asp 


Gly(G) 


Ala; Pro 


. ,- His(H) 


Asn; Gin 




Leu;Val 


V Leu(L) 


ne;Val 


Lys(K) 


Arg: Glny; Glu 


Met(M) 


Leu; Tyr; He 


Phe(F) . 


Met; Leu; Tyr 


: Ser(S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp;Phe 


Val(V) 


He; Leu 



Other substitutions are also pemiissible and may be determined 
empirically or in accord with known conservative substitutions. Any such modification 
of the polypeptide may be effected by any means known to those of skill in this art. 
15 HBEGF polypeptides ihay be isolated by methods known to those of 

skill in the art or may be prepared by expression of DNA encoding a HBEGF 
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! polypeptide (jee, e.g:. International Application WO/92/06705 (and the corresponding 

• U.S. patent application serial No: 07/598,082), and Abrahani et al. (i993yBiochem. 
Biophy. Res. Coinm. 790:125-133 ibid SEQ ID NOs. 1-5 hereifir ' ' 
••■Bi •-'targeted agents- •' '''<••"••-- ^ " ' ' • ' ■ ■ 

-> 1. Cytotoxic agents 

" ^^ytotpxic agent refers to a inolecuie capable of inhiW^^^ cell function. 
Cytotoxic agents include any agent that, upon intemalization, by a eukaryotic cell, 
inhibits growth or proliferation of the cell, either by killing the cell or inhibiting a 
metabolic pathway, transcription, or translation such that cell proliferation slows or 
10 stops. Any agent that, when internalized- inhibits or destroys cell ■ growth, cell 
proiiferaVit>h or bihe^ essential cd fimctions is-suitat/e for use herein. Cytotoxic agents 
include ribosonie inactivating proteins, small nrietabolic inhibitors, antisense nucleic 
acids, toxic drugs,. such as. anticancer- agents, 'and small -molecules. Such as light 

activated porphyrins _ Ribospme inactivating proteins, -such as saj^rin. are the 

1 5 preferred cytptoxic protein agents for use herein dnd hucleic acids are-the preferred non- 

' peptide agents,;- . . _ . . . !.. • 

. . „iSuch cytotoxic agents, include, but are not limited to,-^porin, the ricins, 

•abrin and other RIPs, /'jeurfo/»o/?as exotoxin, littHibitors or protein 

synthesis, including antisense .nucleic„acids_an!ji otaeiL metate^^^ that are 

^^^*^''L*" this art. Saporin is prefeif^, but other suitable jlUPs include, 
'^^^ ^^ "pV*"^'*^'* 'i':^"' A cjiam, maiie RIP; gelonin,. diphtheria toxin and 
Idiphtheria toxin A chain (jree, e.g., U.S. Patent No. 4,675,382), trichosanthin, tritin, 
pokeweed; antiviral piiotein (PAP), mirabilis antiviral protein (MAP), Dianthins 32 and 
30, abrin, monordin, , bryodin, shiga, cytotoxically, active fragrnents of^cytoxins and 
25 others known to those of skill in this art {see, e.^.,.BaFbieri et al. (1982) Cancer Surveys 
' 7:489-520 and .European published patent application No. 0466 222, . incorporated 
herein by refereiice, which provide lists of numerous RIPs and their sources; see. also, 
;U:S. Patent Nb: 1248,608). " ' " . . " 

, The selected cytotoxic agent is, if necessary, derivatized to produce a 
30 group reactive with a cysteine oh the selected HBEGF." If derivatizatibn results in a 
nriixture of reactive spepi^s, a mono-deriyatized form of the .cytotoxic agent can be 
isolated arid then conjugated to the selected HBEGF. . 

2. Ribosome inactivating proteins ^ 

35 saporin, are plant I>roteins that catalytically^ iri^ctivate eukaryotic ribosomes. RIPs 
inactivate ribosomes by interfering with the protein elongation step of protein synthesis. 
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For example, the RIP saporin (hereinafter also referred to as SAP) has been shown to 
enzymatically inactivate 60S ribosomes by cleavage of the n-glycosidic bond of the 
. adenine at position 4324 in the rat 28S ribosomal RNA (rRNA). Some RIPs, such as 
the toxins abrin and ricin, contain two constituent chains: a celNbindmg chain that 
5 mediates binding to cell surface receptors and intemalizatron of the molecule; and an 
enzymatically active chain responsible for protein synthesis inhibitory activity. Such 
RIPs are type II RJPs. Other RIPs, such as the saporiiis, are single chains and are 
designated type I RIPs. Because such RIPs lack a cell-bitiding chain, they far less toxic 
to whole cells than the RIPs that have two chains. 

10 Several structurally related saporins have been isolated from seeds and 

leaves of the plant Saponaria officinalis (soapwort). Among these, SAP-6 is the most 
active and abundant, representing 7% of total seed proteins. Saporin is' very stable, has 
a high isoelectric, point, does, not contain carbohydrates, and is resistant denaturing 
agents, such as sodium dodecyl sulfate (SDS), and a variety of proteases. The amino 

15 acid sequences of several saporin-6 isoforms from seeds are known and there appear to 
be families of saporin RIPs differing in a few amino acid residues. Because saporin is a 
type I RIP, it does not possess a cell-binding chain. Consequently, its toxicity to whole 
cells is much lower than the other toxins, such as ricin and abrin. When internalized by 
eukaryotic cells, howiever, its cytotppcicity is 1 00- to 1 000-fold more potem thain ricin A 

20 chain. , 

Saporin is preferred herein. The sa{>orin polypeptides include any of the 
isoforms of saporin that may he isolated from Saponaria officinalis or related species or 
modified form that retain cytotoxic activity. $uch modified forms have amino acid 
substitutions, deletions, insertions or addition^ but Still express substantial ribosome- 

25 inactivating activity. Purified preparations of saporin are frequently observed to include 
several molecular isoforms of the protein. It is understood that differences in amino 
acid sequences^ can occiu; in saporin from different species as well as between saporin 
molecules from individual organisms of thc5 same species. In particular, such modified 
saporin may be produced by modifying the DNA encoding the protein {see, e.g.. 

30 published International PCX Application WO 93/25688 (Serial No. PCT/US93/05702), 
which is a continuation-in-part of United States Application Serial No. 07/901,71 8; see. 
also, copending U.S. Patent Application No.. 07/885,242 filed May 20, 1992, and Patent 
No. 1231914, granted in Italy on January 15, 1992) by altering one or more amino acids 
or deleting or inserting one or more ammo acids, such as a cysteine that may render it 

35 easier to conjugate to HBEQF or other cell surface binding protein. Any such protein, 
or portion thereof, that, when conjugated to HBEGF as described herein, exhibits 
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cytotoxicity in standard in vitro or in vivo assays within at least about an order of 
, . magnitude of the saporin conjugates described herein is contemplated for use herein. 
, -: T ^. r.-.v. . ^ Thus;- the SAP used herein includes protein that is isolated from 
; , . natinaJbsparces or that is produced by rectiinbihant 'e^^^ copending 
5 , , PuWisheti fotematiohal RCT Applteatfdh W 

y^hich.jis, ,a,.€d»|a^uationHn^part- of lUrtited States Af>plicati6n Serial No; 07/901,7 1 8,' 
,. ,fiIe4J,Mne:16;rl992;.see,al^ :L{ n v . : ; 

- --'i n.vSome.of the DNA-molisciles prov^de^ sap6nn that has 

substantially the same amino acid se^uehcfe ahd nbdsom^-ihacti vati^^ acti vity ks that of 
1,0 ... preferred .s^porin-6 (S0-6),4nclUding'any df fcfiir iiofdhrist Which have heterogeneity at 

ammo.acidpositioiis.487dnd.'9l.Hs&Ci'%:i=^ 
'• J?9Q, and Bai^ et .aLj. Bifftechnbli^APpL BCdchem. B:A^^i:^ \^C^ Patent 
> 2,216,891 B and EP Patent 893061X)6; and SEQ i£) NOS;8il2): Otiidr ^uit^itle 'Wpo^ 
polypeptides .include: other roetobcis^'of the' iniilti^«;iie ^tti'lj^ 'fcddiii^for Mmos of 
15 saporin-type ribosome-inactivating proteins ' including S6-l arid 'sb-3 (Fordham- 
:Skeltqnet aI..A/o/. Gen. Ge«er. 227:134-138; 19^), Soh ' (se'e-. e.gl'/uisfV^iJlicatioii 
Serial No. 07/885,242,.whiGh corresponds to dB 2,216,89iri^^^^^ Fbrdham-Skelton 
4 :,, fit.al., Mpl.:Cen. Ge«e/.^22P:460-466,a991)-SO^iyee; e^?GB'2;i94:^ also, 
; Lappi eX ^.i:Biochem:mi6phys: Res) GdOT/h^w#i^^7'2P?93 4-942;'^ e.g., 
GB 2,194,241 B; see, also, Montecucchi et al.. Int. J. Peptide Protein Res. 3^:263-26^, 

' ' ■ ' • - • ' ' saporin polypeptides' exemplified hereiri intlude those having 
1. substantially the isame amino acid sequence as those listed in SEQ ID NOS. 8-12. The 
• ^'solation .and. expression -.of the DNA' encoding these proteins ' is deSciibed in the 
25.. '.Examples. .•U.-:;: v'oA in:.' ,.v. :<-. t.';---' •- ■ 

r. . : . . . V >f r ; :CThe!sai)orin polypeptides' ihfclude ahy' of the isofofms of saporin that may 
. , ,be.isolated from Saponaria. officinal or related'specles ot modified forms that retain 
: / .icyjotoxic activity.; In .particular, such mcklified saporin m^y be produced by mbdifying 
;> the DNA-encoding the -protein- (jee, ' eg. , International PtT ' Application Serial No. 
30 rPCT/US93/05702.ifiled .ori- jurie 14, 1993. and United States Applifcatidii'Senal N6. 
.^,> 07/90,1^718; see, alsfp, copending U.S. Patent Application No. 07/885,242 fil^d May 20, 
K ^1992, and Italian Patent'N6.' i;231,914) by altering" ohe or more amino dcids or deleting 
or inserting one ormore amino acidsi Any such protein, or portion there6f,^hat exhibits 
'cytotoxicity in standard in 'vitro hr in v/vo assays 'within at least about an order of 
35 magnitude^of the sapori^- conjugates described herein is contemplated fdr use herein. 
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Nucleic acids encoding other ribosome*inactivating proteins 
andcytocides 

In addition to saporin discussed above, other cytocides that inhibit 
protein synthesis are useftil in the present invention. The ^ene sequences for these 
5 cytocides may be isolated by standard methods, such as PCR^ probe liybridization of 
genomic or cDNA libraries, antibody screenings of expression libraries, or obtain 
clones from commercial or other sources. The DNA sequences of many of these 
cytocides are well known, including ricin A chain (Genbank Accession No, X02388); 
maize ribosome-inactivating protein (Genbank Accession No. L26305); gelonin 
10 (Genbank Accession No. L12243; PCT Application WO 92/03155; U.S. Patent No. 
5,376,546; diphtheria toxin (Genbank Accession No. KOI 722); trichpsanthin (Genbank 
Accession No. M34858); tritin (Genbank Accession No. D 13795); pokeweed antiviral 
protein (Genbank Accession No. X78628); mirabilis antivira] protein (Genbank 
Accession No. D90347); dianthih '30 (denbank Accession No. X59260); abrin 
IS (Genbank Accession No. X55667); shiga (Genbank Accession No. Ml 943 7) and 
Pseudomonas exotoxin (Genbank Accession Nos. KO 1 397, M23348). 

DNA encoding SAP or any cytotoxic agent may be used in the 
recombinant methods provided herein. In instances in which the cytotoxic agent does 
not contain ^a Cysteine resiclue, such as instances in which DNA encoding. SAP is 
;20 selected, thd DNA' may be modified Id include a cysteine codon. The ^ codoa.may be 
inserted into any locus that does not reduce or reduces by less than about one order of 
magnitude the cytotoxicity of the Iresulting protein. Such locus may be determined 
empirically by modifying the protein and testing it for cytotoxicity ui ai;i assay, such as 
a cell-free protein synthesis assay. The preferred loci in SAP for insertion of the 
25 cysteine residue is at or near the N-termiriiis (within about 20 residues, preferably 10 
residues, of the N-terminus). 

3* Expression of cytotoxic agents 

Host organisms include those prgariisms in which recombinant 
pix>duction of heterologous proteins have been carried out and in which the cytotoxic 

30 agent, such as saporin is not toxic or of sufficiently low toxicity to permit expression 
before cell death. Presently preferred host organisms are strains of bacteria. Most 
preferred host organisms are strains of £. co//, partiqularly, BL21(DE3) cells (Novagen, 
Madison, WI). . f 

The DNA encoding the cytotoxic agent, such as saporin protein, is 

35 introduced into a plasmid in operative linkage to an appropriate promoter for expression 
of polypeptides in a selected host organism. The presently preferred saporin proteins 
are saporin proteins that have been modified by addition of a Cys residue or 
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: r^Iacemen. of a «„^senUai r«iduc a, or near an,ino- or carboxyl terminus of ,he 
saponn wd, Cys. Saporin, such as 0,a, <>f SEQ ID NO. » has been modified by 
■ • ..»«tton' of Me,^^ by 

- ■ U^U^-^^- '^y ^^^ pn„ein secretion 

:.:. r .:'T^'^°'^ -P,f^ ^""S fe 'o,?!'- Jhe man^ polypeptide >in,o the 
penplasm or culture medium, The .nrsuhing saponn protein can,be,purifled by methods 
routmely ,^1„ the art, including, metlK^ds d<!«ribe4^^ Examples 

10 and more preferably £ CO/,- ccfe as^eihofls applicable for, cltiring said cells 

T''^'™*." *'???"''^™ ^ See 
for example, Sambrook ei^ P^^„^,^ X^Wor. Cold 

Spnng Harbor Ubbratory PreK,,Coid , ., ^ . 

, ' „ . Jh^ DNA fonslruct encoding the! saporin piston; is mtfodUced into the 

15 host cell by any suitaWe mean^ including, butn^t limited t,,^^^ 

plastnids, bacterial phage vectors, transfectip^ electryporation, lipofection. and the like, 
me hetero ogous DNA cin optionally include ^„enpes, , 
.. .hat, allow for the extrichromosomal ma^^^^ =?porin^,aimngTlasmid, or 

ensure stable maintenance in the host). / . , r . , : 

T^f'^"'' pP^'i^^fe^W forthe sit^^ integration; Northem 

-^lots for mducible^promoter^pcmsiv^ s^ria gene, expression;. and product analysis 

forthe presence of saponn.c<;„taining proteins in either the cytop^^^ 
growth media. 

Once the sapbrini^ncoding PNA fiagment has been introduced into the 
' ' host cell, the desired saporin-containing protein is produced by subjecting the host cell 
, ■ I?,^"''"'''"' which the promoter is induced, whereby the operatively linked 

30 . ' DNA IS transcribed. Ih a preferred embodiment, such conditions are those that induce 
' expression from the £. i:on iac operon. the plasmid containing the DNA encoding the 
saponn-containing protein also includes the lac operator (O) region within the promoter 
and may also include the lac 1 gene encoding. the lac repressor protein (...e eg. 
Muller^Hill et al. (1968) Proc. N^il. Acad. Sci. USA 59:1259-12649). The lac repressor 
35 represses the expression from the lac prorhoter tintil induced by the addition of IPTG in 
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an amount sufficient to induce transcription of the DNA encoding the saporin- 
containing protein. » 

The expression of saporin in £. coli is, thus accomplished in a two-stage 
process. In the first stage, a culture of traiisformed E. coli cells is git>wn under 
5 conditions in which the expression of the saporin-containin^ |)rotein within the 
transforming plasmid, preferably encoding a saporin, such described in Example 4, is 
repressed by virtue of the lac repressor. In this stage cell density increases. When an 
optimum density is reached, the second stage commences by addition of IPTG, which 
prevents binding of repressor to the operator thereby inducing the lac promoter and 

1 0 transcription of the saporin-encoding DNA. 

In a preferred embodimefnt, the promoter is the T7 RNA polymerase 
promoter, which may be linked to the lac operator and the £1 coli hosi strain includes 
DNA encoding T7 RNA poiynierase opcrably linked to the lac operator and a promoter, 
preferably the lacUVS promoter. . The presently preferred plasmid is pET 11a 

15 (Novagen, Madison, WI), which contains the T71ac promoter, T7 terminator, the 
inducible E. coli lac operator, and the lac repressor gene. The plasmid pET 15b 
(Novagen, Madison, WI), which contains a His-Tag^^ leader sequence (Seq. ID NO. 
23) for use in purification with a His column and a thrombin cleavage site that permits 
cleavage following purification over the colunin, the T7-lac promoter region and the T7 

20 terminator, has been used herein for expression of saporin. Addition of IPTG induces 
expression of the T7 RNA polymerase and the T7 promoter, which is recognized by the 
T7 RNA polymerase. 

Transformed strains, which are of the desired phenotype and genotype, 
are grown in fermentors by suitable methods well known in the art. In the first, or 

25 growth st^e, expression hosts are cultured in defined minimal medium lacking the 
inducing condition, preferably IPTG. When grown in such conditions, heterologous 
gene expression is completely repressed, vAnch allows the generation of cell mass in the 
absence of heterologous protein expression. Subsequent to the period of growth under 
repression of heterologous gene expression, the inducer, preferably IPTQ,, is added to 

30 the fermentation broth, thereby inducing expression pf any DNA operatively {inked to 
an IPTG-responsive promoter (a promoter region that contains lac operator). This last 
stage is the induction stage. 

The resulting saporin-containing protein can be suitably isolated from 
the other fermentation products by methods routinely used in the art. e.^., using a 

35 suitable affinity column as described in the Examples; precipitation with anunoniiun 
sulfate; gel filtration; chromatography, preparative flat-bed iso-electric focusing; gel 
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electrophoresis, high performance liquid chromatography (HPLC); and the like. A 
method for isolating saporin is provided in Example 1 (see,' also Lappi et al. ((1985) 
,:.-o.Bio€hem,\Biqphvs..Res.. On^un^i '/2P:S?34.942)r. ' 'the-'ex^sed saporin protein is 
: .. isolated from -cithe^. the- -eytopiasm;' periplasm, -tir'"- W -celi' culW medium (see, 
■ - 5 . : discussion below and-Jce. e^^^^^^ Example 3 for preferred methods and skpcjrin proteins). 

. . ... v ' . vlPdrphyriiis We^^ w^^^^^^^^ knoWh^ liglit activatable toxins that can be readily 

• crbiss4inked: td'protbih^''-^^^^^^ es^., U.S. Patent Na' 5.257>70;' "U-'sf Patent No. 
- ' 5^252,720; U.S. Patent Na' 5,'238l940;' U.S: Paterlt No! 5.192;788; US. Patent No. 

10 5.171,749; U.S. Patent No. 5,149,708; U.S. Patent No! I2(i2:ii7; U.S. Patent No 
'-5;217;966; VSri^Mt^Uor^^SiMiU^'Pi^^^ U.S. Patent No 

5.087.636;=tJiS;: Patei^i No: 5^^8:59^;' U:S^P^nt 'Na 5,qW,349; US. Patent No. 
4;968,71 5; U.S.- Patent Nol^4,92b;i4^ andantematioriar Appi^ 

' • -'Porphyrias »fe- bottjiigated to proteins 'by^ direct, coyaiMt bonds using. 
15 for example, a carbodiimidb.^ Liiika^e^niay be'effe^iea by/treatment' of TO 

1 .ethyl-3-3-dirnethyl^ino propyl) carbo diiiiide in the presence of a reaction medium 
- ! such as DMSO. For other methods stee U.S. Patent No. 4><S8,1^i5. "The porphyrin 
^ -HBEGF conjugates' iiiay be administered- topically ^ ' Xctuatipii of the 

' ■ poiphyrin is''t>y irradiating i%hf tHo^^ of 
•-20'--'-'pofphyrin-type''ph6toseriSitizer.' ''H'-'" 

5. Nucleic acids for targeted delivery 

The conjugates provided herein are also designed to deliver nucleic acids 
■ to targeted ^etisl The nUcieic acids include those intended to deliver a cytotoxic signal 

• to a cell or to modify expression of g^^^^^^ ih'erieby e^ect genetic therapy; Examples 
.^^ nucleic acids iridude a^^ RNAJ DNA/ nb«)zym« that 

. P™^^'"s. ' The nucleic acids dan.also include RNA trafficking signals, such as viral 
"^ckagirig sequences (see. e.g^; Sullenger et al. (1994^ 5c/ence 2<52: 1566- 1569). The 
• ' du^lelc ^ids a^^^ hioiecules that encode intact genes or that" encode 

"proteins iisefiil for gene therapy or for effecting cell cytotoxicity. Especially of interest 
are Di^A' molecules that encode an enzyme that results in cell, death or fenders a cell 
suscepfiljle to tiell deaitK upoh the Addition of another product. For example, saporin is 
an enzyme that cleaves rRNA and inhibits protein synthesis. Other enzymes that inhibit 
pfotfein synthesis W especially' Wen M^^^ Other enzymes 

liiay be lised wheire the enzyme activates a compound with littie or no cytotoxicity into 
35- atomic product.' ' - = ■ - ' ' ' . <- 
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DNA (or RNA) that may be delivered to a cell to eflfect genetic therapy 
includes DNA that encodes tumor-specific cytotoxic molecules, such as tumor necrosis 
factor, viral antigens and other proteins to render a cell susceptible to anti-cancer agents, 
and DNA encoding genes, such, as the such as the defective gene (CFTR) associated 
5 with cystic fibrosis (see, e.^.. International Application WO 93/03709, which is based 
on U.S. Application Serial No. 07/745,900; and Riordan et aL (1989) Science 245:]066' 
1073), to replace defective genes. 

Nucleic acids and oligonucleotides for use as described herein can be 
synthesized by any method known to those of skill in this art (see, e.g.. Wo 93/01286, 

10 which is based on U.S. Application Serial No. 07/723,454; U.S.. Patent No. 5,218,088; 
U.S. Patent No. 5,175,269; U.S. Patent No: 5,109,124). Identification of 
oligonucleotides and ribo2ymes for use as antisense agents' as well as selection of DNA 
encodiiig genes for targjeted delivery for genetic therapy, is well within the skill in this 
art. For example, the desirable properties, lengths and^ other* characteristics of such 

15 oligonucleotides are well known. Antisense oligonucleotides are designed to resist 
degradation by endogenous nucleolytic enzymes and include, but are not limited to: 
phosphorothioate, methylphosphonate, sulfone, sulfate, ketyl, phosphorodithioate, 
phosphoraniidate, phosphate esters, and other such linkages {see, e.g., Agrwal et al. 
(1987) Tetrehedrqn LetL 2<S:3539-3542; Miller et al. (1971>y, v</w. Chem. Soc 93:6651- 

20 6665; Stec et al. (1985) Tetrehedron Lett. 2(5:2191-2194; Moody et al. (1989) Nucl 
Acids Res 72:4769-4782; Uznanski et al. (1989); NucL Acids Res. Letsinger et al. 
(19S4) Tetrahedron 40:137-143; Eckstein (1985) Annu. Rev. Biochem 5^:367-402; 
Eckstein (1989) Trends Biol Sci. 74:97.100; Stein (1989) In: Oligodeoxynucleotides, 
Antisense Inhibitors of Gene Expression, Cohen, Ed, Macmillan Press, London, pp. 97- 

25 117; Jagcrctal. (1988) 5/ocAemi5frv27:7237-7246).. 

a. Antisense nucleotides 

Antisense nucleotides are oligonucleotides that bind in a sequence- 
specific manner to nucleic acids, such as mRNA or DNA. When boimd to mRNA that 
has complementary sequences, antisense prevents translation of the mRNA (see, e.g. 

30 U.S. Patent No. 5,168,053 to Altman et al., U.S. Patent No. 5,190,931 to Iriouye, U.S. 
Patent No. 5,135,917 to Burch; U.S. Patent No. 5,087,617 to Smith and Clusel et al. 
(1993) Nucl Acids Res. 27:3405-3411, which describes dumbbfell antisense 
oligonucleotides). Triplex molecules refer to single DNA strands that bind duplex 
DNA forming a colinear triplex molecule and thereby prevent transcription (see, e.g^ 

35 U.S. Patent No. 5,176,996 to Hogan et al., which describes methods for making 
synthetic oligonucleotides. that bind to target sites on duplex DNA). 
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: • Particularly usefui antisense nucleotides and triplex molecules are 
molecules that are complementary^or bind to the seiike strand of DNA or mRNA that 
tr.. o encodes-an.oncogene, such as bFGFvint-2; hst.r^^ FGF-8. 
. , Other useM oligonucleotides include those tKat are specific Ibr lL-S (see, e,g,, 

, 5 :.:U.S.^ Patent .No^ (5;244,049; anii Intem^ibnal appli^^ WO 89/004836; WO 

. 9p/0632U WOl89Zie96i2pWO 90/Od5563r1md WO 91/6^83; a^ the corresponding 

U.S. applications for descriptions of DNA encodirig lL-8 kid aihihb acid 'sequ^^ of 
IL-8), which:, can he Jinked to bFGPi. for ^^ t^^ anti-sense 
,v oligonucleotides that are spedifie fo^noiimusfcle mydsm h^^ chain and)or c myb (see. 
V iP; ; Simons et al. C1992).a/'c*«esL 70:835-843^ WO^^^^ is l^ed on U.S. 

... application Serial iNbi' 07/723,454: X^Clerc^^ J: Ant Coll Cardiol. 17 

(2Suppl yty:105A; Ebbccke el ai:^(1952) Sa5ic7ce<r.^^^G^^^ ^7?585-59lX which can be 
targeted by an FGF to inhibit smooth miiscle cfell p^^^^^ siich as that following 

angioplasty , and - thereby prevent resteriosis or iifihibir viral' g^^^^ in 
15 transformed or infected cells.:!' ^ : a 

••*•■ ■ b. Ribozymes* ' * ' •" 
: i'.. A- ribotyme' is ari RNA mole^cule specifically cleaves RNA 

^0 : Ssubsp^tes; suchasTOes^OTget RNA;^^ 

\ ' f c expression. There are at least 'five classes ri^zyriies th^l are involved 

20 in thd cleavage 'ahd/6r ligaiti6h^of RNA ^h^ns:!^^^ tp any RNA 

'rs transcript and can* catalytically'^^cleaVe iuCh ti^ e.g., XIS, IJatent No. 

^ 5;272,262; U.S. Patent-No. ^5^i44;019; and li.k Patent Nos: 5,168,053, 5J 8 
' r 5,116,742 arid ^5,093,246 to C^ch et al., which described ribozymes and irn^^ for 
production thereof). Any such ribbzyme may be linked to the growth factor for delivery 
25 to HBEGF-receptor bearing cells. ' ' ' ' ' ^ 

The ribozymes may be deiivered to thie tai'geted cells, such as tumor cells 
that express, a receptor to which HBEGF Binds and Uik)n ^binding is internalized, as 
i ; ; DNA encoding the.ribozyiiife linked'fo a eukaryotic prom6tefi'such as a eukarybtic viral 
, promoter, generally a later promoter, such that upon introduction Mto' the nucleus, the 
30 ribozyme will be directly transcribed. In such instances,-the constnict Will also include 
: f a nuclear: translocation sequence (NTSV see Table 2, below), generally as p^ 
. : . growth factor or as part lof a linker between the growth factor and linked DNA. 
• ^ f c;- ■ 'Nndeica'dds encoding thersipe^^^^^^ 

: ;q Ambng the DNA'thdt encodes thera]:>e'utic products contemplated for use 
35 is DNA enc^oding correct Ibopies; of defective "genes, such as the defective gene (CFTR) 
associated with cystic fibrosis {see, e.g., Intemkion^l Application WO 93/^^ 
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is based on U.S. Application Serial No. 07/745.900; and Riordan et al. (1989) Science 
2^5:1066-1073), and anticancer agents, such as tumor necrosis factors; and cytotoxic 
agents, such as saporin. The conjugate should include an NTS. If the conjugate is 
designed such that the HBEGF and linked DNA is cleaved , in the cytoplasm; then the 
5 NTS should be included in a portion of, the linker that remains, bound to the DNA, so 
that, upon internalization, the conjugate will be trafficked to tbe nucleus. The nuclear 
u-anslocation sequence (NTS) may be a heterologous sequence or a may be derived 
from the selected growth factor. 

d. Other nucleic acids 

10 Extracellular protein binding oligonucleotides refer to oligonucleotides 

. that specifically bind to proteins. Sniall nucleotide molecules refer to nucleic acids that 
target a repeptor site. 

e. . Coupling of nucleic acids to proteins 

To effect chemical conjugation herein, the HBEGF protein is linked to 
15 the nucleic, acid either directly or via one or more linkers^. Methods for conjugating 
nucleic, acids^ at the 5' ends, 3' ends and elsewhere, to the amino and carboxyl termini 
and other sites in proteins are known to those of skill in the art (for a review see e.g., 
Goodchildv (1;993) In: Perspectives in Bioconjugate Chemistry^ Mears, Ed., American 
Chemical Society, Washington, D.C. pp. 77-99). For example, proteins have been 
20 linked to nucle?c acids ui;ing ultrrtviolet irradiation (Sperling et al. (1978) Nucleic Acids 
Res. 5:2755^2773; Fiser et al (1975) FEES Lett. 52:281-283), bifunctional chemicals 
(Baumert et al. (1978) £z/r. J. Biockem. «9:353-359; and Oste e; al. (1979) Mol Gen. 
Genet. 765:81-86) photochemical cross-linldng (Vanin et al. (1981) FEBS Lett. 72^:89- 
92; Rinke et al. (1980) J.MoLBioL 757:301014; Millon et al. (1980); £t/r. J. Biochem. 
25 770:485-454). 

In particular, the reagents (N-acetyl-N*-(p-glyoxylylbenzolyl) cystamine 
and 2-iminothiolane have been used to couple DNA to proteins, such as a 
2macroglobuIin (a2M) via mixed disulfide formation (see, Cheng et al. (1983) Nucleic 
Acids Res. 77:659-669). N-acetyl-N-(p-glyoxylylbenzo!yl)cystamine reacts specifically 

30 with nonpaired guanine residues and^ upon reduction, generates a free sulfhydryl group. 
2-lminothiolane reacts with proteins to generate sulfhydryl groups that are then 
conjugated to the derivatized DNA by an intermolecular disulfide interchange reaction. 
Any linkage may be used provided that, upon internalization of the conjugate the 
targeted nucleic acid is aaive. Thus, it is expected that cleavage of the linkage may be 

35 necessary, although it is contemplated that for some reagents, such as DNA encoding 
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^ ribozymes linked to promoters or DNA encoding therapeutic agents for delivery to the 
■ nucleus, such cleavage may not be necessary. ^ ■ 
^: • - 1 . )! Thiol' linkages caii be readily fofnied usiiig heterobi functional reagents. 
■Amine&ohave riMSoi-^bejen sJttac^^^^^ -to W t^rihinail '5* phosphate of ' uiiprotected 
5/ 'Oligohucleotid^s w^tieleicacias^m aqueous sc^lutidn^ by re^^ with 
:.. . ' - ^a .wateri-soluble^^darboaiiMMfe. 'such 'tis j-fetKyi:3'[3^tethy 

ho io5(EDG)ii or^ Niethyi'iN'<3-diifl6t^i'ylaiiiinbpf^^^^ in 
imidazole buffer at pH 6 to produce the S'phosphbriiinidizbiide: Contacting the 
S'phosphorimidazolide with amineHJohtiiining moleciiles, such as HBEGF, and 
10 ^ethylenediaminevresults in stable phdsp^^ et al. (1983) Nucleic 

Mds^Res^ /A:6513-6529;!and:: WO 88/05077 in' wMch the'U.S/is 'd^^^^ In 
particular, a solution of DNA is saturated with EDC, at pH S 'and iriciibated with 
agitation at 4° C ovemightv . TheiresuUing soliition is then buffered to pH 8.5 by adding, 
for example about- 3 jvolumes of . 1 00 mM citrate buffer: and addihg about 5 ng - 20 ^g 
.15 .of an HBEGF;. and agitating; the resulting mixture at 4° C for'dbbut 48 hoia^. The 
unreacted protein may be removed from the riiixture by eolumh chromatography using, 
for example, Sephadex G 75 (Pharmacia) using 0.1 M^ aihmdhium carbonate' solution, 
.-,^v;%-..PH 7.0 as an eluting tmffef; The> isolated conjugate iflay be lydpftiliysd and 'Stored until 
. used. -I •: f -v D.C , r 07i. ■ ■ : "iSi; W lr.:>n:iS: 

2P; . r. ; r :■ US. Patent No. 5237,016 providels methods^ for preparing'- hii^ 
- that are.bromacetylate:d: at tiieir 5Vterinirti and reacting the resulting' bligbnucieotides 
,■ with thiol groups. Oligonucleotides derivatized at their 5'-termini brbmoacetyr groups 
...can, be prepared' by reacting 5'-amihohexyl-phdsphoramidate oligoniicleotides with 
,. ,, brompacetic acid-N-hydroxysiicbinimidie " ester : as described iii U.S. ' Patent No. 
25 5,237,016. U.S. Patent No. 5,237,016 also describes methods for preparing tiiibl- 
. • derivatized, nucleotides,, which, can then: be reacted with thiol; groups on the selected 
growth factor.. :Briefly; thiol-derivatized nucleotides are prepared using a 5'-phosphory- 
lated nucleotide in two steps: (1) reaction of the phosphate group with imidazole in the 
presenpe of a diimide^and displacement of the imidazole leaving group with cystamine 
30. .fn pne reactipn'tstep;.jand,. reduction of the disulfide bond of the cystamine linker with 
r, : dithio^itpl (see, also, Orgel .et aL (1986) A^mc/. ^ci<fe..Re^. . 7^:651, which describes a 
- -similar procedure).. The 5'-phosphorylated starting oligonucleotides cart be prep^-ed by 
methods known to tiipse.of skill in the art isee;.e.g., Maniatis et al. (1982) Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory; New York, p. 1 22). 
35 . ' .' The antisense oligomer or nucleic acid, such as a methylphosphonate 
oligonucleotide (MP-oligomer), may be derivatized by reaction with SPDP or SMPB. 
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The resulting MP-oligomer may be purified by HPLC and then coupled to HBEGF, 
which may be modified by replacement of one or more non-essential cysteine residues, 
as described above. The MP-oligomer (about 0.1 p.M) is dissolved in about 40-50 |il of 
1:1 acetonitrile/water to which phosphate buffer (pH 7,5, final concentration 0.1 M) and 
5 a 1 mg MP-oligomer in about 1 ml phosphate buffered saline is added. The reaction is 
allowed to proceed for about S-10 hours at room temperatiu-e and is then quenched with 
about 15 tiL 0.1 iodoacetamide. The HBEGF-oligonucleotide conjugates can be 
purified on heparin sepharose Hi Trap colunms (1 ml, Pharmacia) and eluted with a 
linear or step gradient. The conjugate should elute in 0.6 M NaCL 

10 f* Nucleic acids encoding cytocides : 

A cytocide-encoding agent is a nucleic acid molecule (DNA or RNA) 
that, up)on internalization by a celL and subsequent transcription and/or translation into 
a cytocidal agent, is cytotoxic to a cdl or inhibits cell growth by inhibitinjg protein 
synthesis. 1 > ^ 

15 Cytpcides include saporin. the ricins, abrin and other ribosome- 

inactivating proteins, Pseudomonas exotoxin, diptheria toxin, angiogenin, tritin, 
dianthins 32 and 30, momordin, pokeweed antiviral protein, mirabilis antiviral protein, 
bryodin. ar^giogemn, and shiga exotoxin, as- well as other cytocides that are known to 

. , those pfskjll in the art, 

20 . ; Especially of interest are DNA molecules that encode an enzyme that 

results in cqII death or renders a cell susceptible to cell death upon the addition of 
another product. For example, saporin, a preferred cytocide, is an enzyme that cleaves 
rRN A and inhibits protein synthesis. Other enzymes that inhibit protein synthesis are 
. especially well suited for use in the present invention. In addition, enzymes may be 

25 used where the enzyme activates a compound with little or no cytotoxicity into a toxic 
product that inhibits protein sjTithesis. 

In addition to saporin discussed above, other cytocides that inhibit 
protein synthesis are useful in the present invention. The gene sequences for these 
cNiocides may be isolated by standard methods, such as PCR, probe hybridization of 

30 genomic or cDNA libraries^ antibody screenings of expression libraries, or obtain 
clones from conunercial or other sources. The DNA sequences of many of these 
c>iocides are well known, including ricin A chain (Genbiank Accession No. X02388); 
maize ribosome-inactivating protein (Genbank Accession No. L26305); gelonin 
(Genbank Accession No. L12243; PCT Application WO 92/03155; U.S. Patent No. 

35 5,376,546; diphtheria toxin (Genbank Accession No. KOI 722); trichosanthin (Genbank 
Accession No. M34858); tritin (Genbank Accession No. D13795); pokeweed antiviral 
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.> :prote.n..(Genbank Accession No. X78628); mirabllis antiviral protein (Genbank 
. Accession : No. . D.90347); .dianthin :30 (Genbank Accession No. X59260) abrin 
r . (Genbank Accession. :No;i.X55667); shiga (Genbank^ Accession Nb. Ml 9437) and 
:Pseudomonas exotoxin (Geribaiik Accession Nos. K0^397; M23348). " ' " 
•'••'•i»''in-i>»«'<ase. of' 6ytotociaai:niolecUles subfr as' the ^ 
V ^ .proteins.. yery-:fe.w.inc^ecules may hfeed be preS^^^ only a single 

. ...molecule of diiAtherraiito?coid iritit»diiced into^ a was sufficient to kill th^ cell In 
- other cases.nit -anay bb.that prorogation or stable maintenance- of the cdristruct is 
necessary to attaint sufficient- ntimbers or-cdncehtfations of the gene product for 
10 effective gene therapy. .Examples of replicating ihd stable eukaryotic plasmids are 
found.inthescientifi(^Jiferature.'\ . rt .-.n u- ''■ ■■ ..r. • > 

, - In general; constructs will also- contain elemehis^ necessary for 
transcription^ and translation. ; If the cytoeide^encoding agent is .DNA; then it must 
contam a promoter. The choice of the promoter will depend upon the cell type to be 
15 transformed and the degree or type of control desired. ' Promoters can be constitutive or 
act! ve many cell type,r tissue, specific, cell spetificvevemspeeific. of i^^^ Cell- 
- - type specific promoters and event type specific, promoters are preferred: Ex^ples of 
, . .Qonstitutive or nonspecific; promoters.include>the SV40 early prbmdter (U.S : Patem No. 
5,118,627). the SV40 late promoter (U.S. Patem No. 5J r8;627)j CMV -feafly gene 
20, .proipoter (U.S, Patent No. 5,168,&62),i and aden^virtis promoterl In addition to viral 
, . :KPmoteK,,eellul:ar,promo.ters are: also ameiteble^ within the confekt of this invention. In 
., yparU.cular.-cellu!ar,promQters for theso-called housekeeping genes'^e useful: - ' 
• ■ • . .Tissueispecificrprpmoters are particularly useful when a particular tissue 

. type is to.ibe targeted fbr transformation.rK3y',ising one of this cl^s of promoters, an 
2^.., extra margin of.=specificity can be attained.^Eor ex^le; when the indication to be 
treated is ophthalmological, either the alpha.crystalline promoter or gamma-crystalline 
: promoteriis preferred. When> a tumor is the target of gene delivery, cellular promoters 
. for SR^ifiq,tumor,inarkers or promoters more active in tumor cells should be chosen. 

.Th.us, ,to.=transform prostate tumor cells; the -prostate-specific antigdi promoter is 
30.. r especially, useful-. sSimilarly.^the tyrosinase = promdter 6r tyrosinase^related protein 
. promoter is a preferred promoter for melanoma treatmem. Fttr B lymphocytes, the 
. -iminunoglpbulin variable region gene promoter, for T lymphocytes, the TCR receptor 
... variable:.regipn<promoter, for helper.T lymphocytes, the CD4 promoter, for liver, the 
albumin prompter,; are but a few examples' of tissue specific promoters. In certain 
35 applications, such as treatment of restenosis, a promoter for myosin li^ht chain specific 
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for smooth muscle cells is preferred. Many other examples of tissue specific promoters 
are readily available to one skilled in the art. 

Inducible promoters niay also be used. These promoters include the 
MMTV LTR (I^CT WQ 91/13160), which is inducible by dexamethasone, 
5 metallpthionein, which is. induciblcf by heavy metals, and; promoters with cAMP 
response elements, which are inducible by cAMP. By using, an inducible promoter, the 
nucleic acid may be delivered to a cell and \yill remain quiescent until the addition of 
the inducer. This allows further control on the timing of production of the therapeutic 
gene. 

iO . Event-type specific promoters are active only upon the occurrence of an 

event, Sfuch as tumorigenecity or viral infection. The HIV LTR is a well known 
example of an event-specific promoter. The promoter is inactive uiilcii^ the iai gene 
product is present, which occurs upon viral infection.^ ; 

Additionally, promoters that are coordinately regulated with a particular 

15 cellular gene may be used. For example, promoters of genes that are coordinately 
expressed when a particular HBEGF receptor gene is expressed may be used. Then, the 
nucleic acid will be transcribed when the HBEGF receptor is expressed. Tliis type of 
promoter is especially useful when one knows the pattern of HBEGF receptor 
expression in a particular tissiie, so that specific cells within that tissue may be killed 

20 upon transcription of a cytotoxic agent gene ^vithout affecting the surrounding tisisues. 

Alternatively, cytocide gene products may be. noncj^otoxic but activate a 
compound, which is endogenously produced or exogenously applied, from a nontoxic 
form to a toxic product that inhibits protein synthesis. — 

The construct must contain the sequence that binds to the nucleic acid 

25 binding domain^ if the domain binds in a sequence specific manner. As described 
below, the target nucleotide sequence may be contained ^^dthin the coding region of the 
cytocide, in which case, no additional sequence need be incorporated. It may be 
. desirable to have multiple copies of target sequence. If the target sequence is coding 
sequence, the additional copies must be located in non-coding regions of the cytocide- 

30 encoding agent. The target sequences of the nucleic acid binding domains are typically 
generally known. The target sequence may be readily determined, . ia any case. 
Techniques are generally available for establishing the target sequence (e.g., see PCT 
Application WO 92/05285 and U.S. Serial No. 586,769). 

Specificity of delivery is achieved by coupling a nucleic acid binding 

35 domain to a receptor-binding internalized ligand, either by chemical conjugation or by 
constructing a fusion protein. Linkers as described above may be used. The receptor- 
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. binding internalized ligand part confers specificity of delivery in a celHspecific manner. 

The choice of the receptor-binding internalized lig^id to us^ will depend upon the 
■J .receptor expressed.by the- target cell's, '^e receptor tyjje oftlie target cell population 
. -:may^be.determiii^d -by:.^^^ techniques iiieh as iitibody staihiiig, PGR of 

.5 QPNA;«5ing>receptor.speeific primeifs, and l>ioc6dni^ 'dr ftmcti tfh^ ' rci^b^tor binding 
,.: . :assay^..:.I,tr)i&.preferable that the^ fec^tor be'^ceil type spfe6if,c or havb increased 
: ; .expressiOn-orr activity. </,e,.highef^^,^^^ the'target cell 

;population. liO;: VI.- : T.n I'.- .,, «::■.; ■.- • 

The nucleic acid binding domain can be of two types, non-specific in its 

10 ability to bind nucleic acid-or- highly specific st^'thktthe almihb'atid residues bind only 

the .desired nucleic.acid sequenced' 'Nonspecific W or 

. . compounds^ generally poly cations or highly basic:"Lys and /^g are' the ^ basic of 

the 20 common amino acidsl prOtfeins enriched fbf theW residues for 

nucleic acid binding domains-^Examples of basic pbteins incliide W protamines, 

15. and repeating, units of lysine and arginirie. ' PolyiL-lysihe is a acid 

binding:domain (5ee U.S. Patent Nos. 5vl 66,320 iid 5;354,84^^^^ Oth^r polycations! 

- o .<: ,;such as spermine and spermidine, may also be used' to bind hucleic acids. By way of 

; .. q example;the sequence-specific proteins including Sp-K Ai>ir, mybD aiid'the rev gene 

t.n.:.!productfrom.HIVimaybeuse^: Specific 'niitl^icabidbiiidiridby^ 

20,tx^andem,^individually^or multiply tc) a desired r^^ 

ligand of; interest; Alternatively, the domains caH be^ chemicaily conjugated to each 
-other..'.' .,•-<;:;:... -\ i;i'-.';-< -j...- '^i;'- ; . . , >:. .>•■ - J 

The corresponding responses elements 'that bind sequence-specific 
' domains are incorporated ^irito^he cohstruct'tb be'deliver^d. Complexing the cytocidal- 
25. encodihg agent to the receptor-binding inteirnalizdd ligand^niicieic^acid iiiiding domain 
• allovvs specific binding of response element to the liucleic acid bindfrig dbriiairi. Even 

.greater specificity, of binding may be achieved by identifying' and uiing 'the minimal 
i; . ^ amino: acid sequence that binds to the xytocidal-encodihg = agent of interestr For 

example, phage display methods can be used to identify amino acids residue^ of varying 
30. length that will bind to specific hucleic acid sequences with high Affinity.' (Si^ vM. 
... .. Patent; :No: 5,223,409.) The peptide sequence can- "then be cloned" irito the ieceptor- 

binding .internalized ligand as a single copy or multiple copies. Altemativdl'y, the 

peptide may be chemically conjugated to the rec^tor-binding internalized ligand. 

Incubation of tlie-cytocide-enctoding agent with the conjugated proteins will result in a 
35 specific binding between the two. " - '• ' 3 • ' ' 
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These complexes may be used to deliver nucleic acids that encode 
saporin or other cytocidal proteins into cells that have appropriate receptors that are 
expressed, over-expressed or more active in internalization upon binding. The cytocide 
gene is cloned downstream of a mammalian promoter such as SV40, GMV, TK or 
5 Adenovirus promoter. As described above, promoters of interest may be active in any 
cell type, active only in a tissue-specific manner, such as a-crystalline or tyrosinase, 
event specific or inducible, such as the MMTV LTR. 

Receptor-binding internalized ligands are prepared as discussed by any 
suitable method, including recombinant DNA technology, isolation fi"om a suitable 

10 source, purchase fi-om a commercial source, or chemical sj'nthesis. The selected linker 
or linkers is (are) linked to the receptor-binding internalized ligands by chemical 
reaction, generally relying on an available tliiol or amine group on the receptor-binding 
internalized ligands. Heterobifunctional. linkers are particularly suited for chemical 
conjugation. Alternatively, if the linker is a peptidejinker, then the receptor-binding 

15 internalized ligands, linker and nucleic . acid binding domain can be expressed 
recombinantly as a fusion protein. 

HBEGF may be isolated from a suitable source or may be produced 
using recombinant DNA methodology, discussed below. To effect chemical 
conjugation herein, the groAvth factor protein is conjugated generally via a reactive 

20 amine group or thiol group to the nucleic acid binding domain directly or tte-ough a 
linker to the nucleic acid binding domain.. The growth factor protein is conjugated 
either via its N-terminus. C-terminus, or elsewhere in the polypeptide. In preferred 
embodiments, the growth factor protein is conjugated via a reactive cysteine residue to 
the linker or to the nucleic acid binding domain. The growth factor can also be 

25 modified by addition of a cysteine residue, either by replacing a residue or by inserting 
the cysteine, at or near the amino or carboxyl terminus, within about 20, preferably 10 
residues from either end, and preferably at or near 4he amino terminus. 

In certain embodiments, the heterogeneity of preparations rtay be 
reduced by mutagenizing the growth factor protein to replace reactive cysteines, 

30 leaving, preferably, only one available cysteine for reaction. The growth factor protein 
is modified by deleting or replacing a site(s) on the growth factor that causes the 
heterogeneity. Such sites arc typically cysteine residues that, upon folding of the 
protein, remain available for interaction with other cysteines or for interaction with 
more than one cytotoxic molecule per molecule of heparin-binding growth factor 

35 peptide. Thus, such cysteine residues do not include any cysteine residue that are 
required for proper folding of the .growth factor or for retention of the ability to bind to 
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. a growth factor receptor and internalize. For chexnical Conjugation, one cysteine residue 
that in physiological conditions, is available for interaction; is not replaced because it is 
used:as^ti,e=site for linking the cytotoxic moiety. TTie resulting modified heparin-binding 
gmwth ^<»r>is.c<«ju8ated--Witha siAistlel^ie^ of ^dtbxic conjugate.' ' 
''^''v.t>r:r^ ' "^'^^'^^^^^ «>n»^t>utioh of each cysteine to the ability to bind to 
. , HBEGF .receptprs.:may be determined ' feriipirically: E^ch. cysteine residue may be 
systematically replaced with a conser^.ative ariiinb acid change (see table 1, above) or 
deleted. Jhe resulting mutein is tested Yor the requisite biological activity: the ability to 
^ ^ bmd to .^owth. factor i^ceptors arid internalize liniced^Wcleic acid binding domain and 
10, . agents. If thc^mutein stains this activity; thel, the cysteine residue is not required 
Addaronal cysteines are:systematically deleted-^d^re^^^^^ ^d the resulting muteins 
are te^ed for activity.. Each of the r^m^kirig cystine re,id „.ay^bc systemaucaiiy 
deleted and/or replaced by a serine i^sidue' iir 6ther r^sidu^ that ^^o^^ld not be expected 
^ to -alter the structure of the -proteiri. ^ The resulting peptid^; is tested ' for biological 
15 activity. If the cysteine residue is neces.^ for retention of biological activity it is not 
deleted; ,f ,t not necessary, then it is preferably replaced With a serine or other residue- 
. : that should not alter the secondary structure 6f the resulting protein. In this mamrer the 
i: : ^immum number and identity of th^ cyst^in^^^^i^d to reU ihe>,iiJty tc^Wnd to a 
: bepann-binding growth f^rr.ccpt^m mhMe'M^^^^ It is noted ' 

2P,, iK,wev,r, :that.modified or^rhutatit^fe^ gro*^%ciors maj Sibit r^duc^d ' 

: : Qr no:prolifeative activity/but^mab^ be siiit^bfe for is6 h^^lin:^ if^^/retain the ^ability 
. . ^ t^get a.IinHed cytotoxic agent tb'celis b^i^g receptor^ to which the unmodified 
.hepann-brnding ,^wth factor birids^^d^ fi^ft in irite^ali^ti^^oF th,%totoxic 
: moiety,- - \ - f>:i, ••'.u^n'>(.^ rc-roiuj . v ;j 'j.j '..I o *: J 

' • ' ■ * '^^''^''^^^^^^^^^ 

up to all cysteines iriUhfe HBEGF polyp^^rtide th^ ^ nbf recuireci for biologicaj 
activity can be deleted of replaced: Alterihativeiy; for 'irs^ In ri,e dremkai conjugation 
, methods herein, all except ' one t>f these cysteines: Vvhich will be' used for chemical 
conjugation to the < cytotoxic* ageht; c^ be deleted oVrepldb^d. Each o} the HBEGF 
30 polypeptides described herein have six cysteine rtsidtfes. E^^h of th^ six cysteines may ' 
. independently , be replaced and the resulting 'mutein Wed foV the abilitv to bind to 
HBEGF receptors and to be internalized. Alternatively, the^resulting mmein-encoding 
DNA IS used as part of a construct containing DNA encoding the nucleic acid binding 
domainJinked to the HBEGF^encoding DNA. The construct is expressed in a suitable 
35 host cell and the resulting protein tested for the kbility to bind to HBfeGF receptor 
internalize;.. As long as this ability is retained th"^ iiiuteih is suitable for use hereiii ' 
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The HBEGF monomers are preferably linked via non-essential cysteine 
residues to the linkers or to the targeted agent. HBEGF that has been modified by 
introduction of a Cys residue at or near one terminus, preferably the N-terminus is 
preferred for use in chemical conjugation. Methods for coupling proteins to the linkers, 
5 such as the heterobifunctional agents, or to nucleic, acids^ or to proteins are known to 
those of skill in the art and are also described herein. 

Methods for chemical conjugation of proteins are known to those of skill 
in the art. The preferred methods for chemical conjugation depend on the selected 
components, but preferably rely on disulfide bond ; formation. To effect chemical 
10 conjugation herein, the HBEGF polypeptide is linked, via one or more selected linkers 
or directly to the nucleic acid binding domain. . - 

A nucleic ac^d binding domain is prepared for chemical conjugation. For 
chemical conjugation, a nucleic acid binding domain may be derivatized with SPDP or 
other suitable chemicals. If the binding domain, does not have a Cys residue available 
1 5 for reaction, one can be either inserted or substituted for another amino acid. If desired, 
mono-derivatized species may be isolated, essentially as described. 

For chemical conjugation, the nucleic acid binding domain may be 
derivatized or modified such that it includes a cysteine residue for conjugation to the 
receptor-binding internalized ligand. , Typically, derivatization proceeds by reaction 
20 with SPpP. This results in a heterogeneous population. For example, nucleic acid 
binding domain that is derivatized by SPDP to a level of 0.9 moles pyridine-disulfide 
per mole of nucleic -acid binding domain, includes a population of nonrderivatized, 
mono-derivatized and di-derivatized SAP. Nucleic acid binding domain proteins, which 
are overly derivatized with SPDP, may lose ability to bind nucleic acid because of 
25 reaction with sensitive lysines (Lambert et al.. Cancer Treat Res. 57:175-209, 1988). 
The quantity of nonrderivatized nucleic acid binding domain in. the preparation of the 
non-purified material can be difficult to judge and this may lead to errors in being able 
to estimate the correct proportion of derivatized nucleic acid binding domain to add to 
- the reaction mixture. 

30 Because of the removal of a negative charge by the reaction of SPDP 

with lysine, the three species, however, have a charge difference. The .methods herein 
rely on this charge difference for purification of mono-derivatized nucleic acid, binding 
domain by Mono-S cation exchange chromatography. The use of purified mono- 
derivatized nucleic acid binding domain has distinct advantages over the non-purified 

35 material. The amoiint of receptor-binding internalized ligand that can react with nucleic 
acid binding domain is limited to one molecule with the mono-derivatized material, and 
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it is seen in the results presented herein thiat a more homogeneous conjugate is 
produced;- There may still be sources of heterogeneity withi the mono-derivatized 

- - hudeid aeid binding domain used here' but is accbjim^^ long as' binding to the 

'eytockiei^ebdirigajg^ ' ' " ' ■ ^ ^- v^-" i 

" . 5. / • . ' r ;\/:?. n BS^aiise^bre than bne amino giroup'ok &e ftucieic atid binding domain 

may react with the succinimidyl moiety; if is posisible thiat niWe tiiW one am group 

bh'th(^ siMace^b^'tt^ iTiis' fcfe^tes ^len^^ 

' mdno-derivatijted riuclei16' acid biriding domm^^^ As an altematiive to iicrivatizing to 
' - introduce a' sulihydryl^'^hfe" nUcteic- aiciy bindiBn^ dbmaih carf be ' mwH the 
10' ihtroductibh of a cystein^ rfesidu^/'^PfefOT 

include the N-terminus region, pref(^bly vWthiri'kbc^^ twenty residues from the 

- N terminus ^ of the nucleic add' bihdm^ dbmldihf •'ttsing either methodology (reacting 
mbno-deri vatized nuclfeic sacid Inht^ing dbnriain or inti-odUcing a Cys residue into nucleic 
acid binding domain), tfie ' resulting preparation of chemical "^co^^^ are 

15 ; monogehous;= eomposiiiohs contaihirig the conjugates also' appear to be free of 
aggregates. As a preferred alternative, heteteg^nei^ cah^ 1^^^ by producing a 

: fusion protein of receptor-bindihg iriternMized ligahd and riucleic iacid binding domain, 
^'''as-deseri'bed beloiv:-^'^' - ^'^-•^-'y ''^j"^- • ^^^'-'-o'Hf: rr-.: . 

• : 01 vo - • Exprb^sibri bf DN 

20 -ligaiid pblypisptidS linked to - thfe^ bihiding^'Hon^m^ results ' iri a more 

homogeneous pfep^atibri of cy^^ Aggregate fo^^^ 

in prepiafations cbhtainmg the ftisioh proteihs by' modifying the receptor binding 
internalized ligahd; such to b of nbnesSential cys^^^^ auid/or the nucleic 

acid binding domain to prevent iiriteiifctioiis betweeh conjugk^ free cysteines. 

25 " DMA ehcodiiig the pblypefitides^ m^^ be 'isblattedr synthesized or 

obtained? from conunercial isoiixtes dr prepared as described herein* ' Expression of 
recombinant polypeptides may be perfbrmed described herein; iand " DN A "encoding 
' these jjolypeptidbs may be lised as'the s^Sairting mkteria^^ for thie niethiods herein. ' 

As described above, DNA encoding HBEGF are described above. DNA 

30 may be prepared %ritheucally - base^ bn^ the amino acid or "DNA sequence or may be 

" ' isolated " using methodi^^lotowii to those ' of skill in the; aft; such as PGR, probe 
hybridization of libi^es,'and the like or bbtained from conunerciarbr other sources. 

As described herein, such DNA may then be mutageni zed using standard 
methodologies to ' delete = or replace any c^^ residues that are responsible for 

35 - aggregate forination. If necessary, the identity of cysteine residues that contribute to 

• aggregate formation may be determined empirically, by deleting and/or replacing a 
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cysteine residue and ascertaining whether the resulting giowth factor witli the deleted 
cysteine forms aggregates in solutions containing physiologically acceptable buffers 
and salts. Loci for insertion of cysteine residues mzy also be determined empirically. 
Generally, regions at or near (within 20,. preferably 10 amino acids) the C- or, 
preferably, the N-terminus are preferred. 

The DNA construct encoding the fusion protein can be inserted into a 
plasmid and expressed in a selected host, as described above, to produce a recombinant 
receptor-binding internalized ligand—nucleic acid binding domain conjiigate. Multiple 
copies of the chimera can be inserted into a single plasmid in operative linkage with one 
promoter. When expressed, the resuhing protein will then be a multimer. Typically, 
two to six copies of the chimera are inserted, preferably in a head to tail fashion, into 
cneplasmid. 

To produce monogenous presparations of fusion protein, HBEGF DNA is 
modified so that, upon expression, the resuHing HBEGF portion of the fusion protein 

1 5 does hot include any cysteines available for reaction. In preferred embodiments, DNA 
encoding an HBEGF polypeptide is linked to DNA encoding a nucleic acid binding 
domain. The DNA encoding the HBEGF polypeptide or other receptor-binding 
internalized ligand is modified in order to remove the translation stop codon and other 
transcriptional or translational step signals tha'. may be present and to remove or replace 

20 DNA encoding the available cysteines. The DNA is then ligated to the DNA encoding 
the nucleic acid binding domain polypeptide directly or via a linker region of one or 
more codons between the first codon of the nucleic acid binding domain and the last 
codon of the HBEGF. The size of the linker region may be any length as long as the 
resulting conjugate binds and is internalized by a target cell. Presently, spacer regions 

25 of from about one to about seventy-five to ninety codons arie preferred. The order of the 
receptor-binding internalized ligand and nucleic acid binding domain in the ifiision 
protein may be reversed. If the nucleic acid binding domain is N-terminal, then it is 
modified to remove the stop codon and any stop signals. 

If the HBEGF or other ligand has been modified so as to lack mitogenic 

30 activity or other biological activities, binding and internalization may still be readily 
assayed by any one of the follovwng tests or other equivalent tests. Generally, these 
tests involve labeling the ligand, incubating it with target cells, and visualizing or 
measuring intracellular label. For example, briefly, HBEGF may be fiuorescehtiy 
labeled vnxh FITC or radiolabeled with 525] Fluorescein-conjugated HBEGF is 

35 incubated with cells and examined microscopically by fluorescence microscopy or 
confocal microscopy for internalization, When HBEGF is labeled with 125i^ the 
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, labeled HBEGF is incubated with cells at ^''C. Cells are temperature shifted to 37°C 
. aj?d washed with 2 M. NaCl at low pH to remove aiiy cell-bound HBEGF. Label is then 
counted and thereby measuring initemalization of HBEGp; Alternatively, the ligand can 
- , be. conjugated with ^an nucleic acid binding domaiin by any of the methods described 
5 herein and complexed with a plasmid encoding Mporinr below, the 

complex nMy.,l)eiiised!^.tranSfect cells ^a^^ 
; • 1 ... .jJhe DNA encOdihg the resulting receptor-ttnding internalized ligand — 
. nucle^acidbmdtng domain ban'be ihserfe^ expressed in a selected 

• I?.P5t, .as desQrjbedr^ibove; to produce a mc^^ 

,. .. ; : - . f- i Multiple-^ c6pies5'''6f tfife' modified ' rec6ptor-binding internalized 
. ligand/nucleicLacid .binding d6main' bKiinc!ra 'can be inserted into a single; plasmid in 
operative linkage with one promoter. ^Tien expressed, the resulting protein will be a 
.multimer, Typically .two> to. six copies of tHe'fchimera ^e insCTted, preferably in ,a head 
to tail fashion, into; one plasmid.' . Merely by way of exariiplfei DN A encoding human 
15 bFGFr has been mutagenized using *^iplicing by ova-lip exteiwion (SOE). . Each 

application of the SOE method uses two ainplified oligonucleoti^^ 
, : haye.complenientary ends as primers aiid which include an' altei^d codpii at the locus at 
yr:.u.^^^<^^ *e mutation is desired, to prddiifce a- hybnd'p^^^ 

;. v; reaction that .uses two primers thaf anneal^^-^t th^" noh-oye^^^^^ the 
20, ... hybrid to produce' DNA that has the de^^^ "' - ' • ' " ' 

. -n. i> .The receptor-binding internalized ligahd/nueleic acid binding domain is 
. incubated with; the cytocide^ncoilirig iagent, typically a DNA molecule, to be;,delivered 
^der .conditipns that allow; binding of the hucleic acid binding domain ito the . agent. 
.. ^Conditions wl|..yaiy sbmewhat' depending^ on the natiure of" the nucleic acid, binding 
2^ , domain,, but will typically occur- in 0;IM N^tl and 20 mM rtEPES or o^ 
.. , bjuffer. , •-!:;...,! •■■ -• ■' ■ .■>■'■ 

■ -. The desired application is the delivery of cytotocidal agentsj such as 
saporin, in a non-toxic formi By. delivering 'a nucleic acid molecule capable of 
expressing saporin, ithe timing of cytotbxicity may be exquisitely controlled. For , 
^® ! :i*^''??"P'^' ^*P°^" »s expressed under the control of a tissue-specific prompter- then 
: : PP^^ of the complex by cells having the tissue-spedfic fabtors necessary, for promoter 
. aqtivation will result in the killing of those cells. On the other hand, if cells' taking up 
the complex do not have those tissue-specific factors, the cells will be spared. 

• , ^ . Merely by way of example, test cbhstriicts have been made and tested. • 
35 . One. construct is a chemical conjugate of bFGF and poly-L-lysine. The bFGF molecule 
, is a variant in which' the Cys residue at position '96 his bfeeri changed to a serine: thus. 
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only the Cys at position 78 is available for conjugation. This bFGF is called FGF2-3. 
The poly-L-lysine was derivatized with SPDP and coupled to FGF2-3. This FGF2- 
3/poly-L-lysine conjugate was used to deliver a plasmid able to express the |J- 
galactosidase gene. ' 
5 The ability of a construct to bind nucleic acid molecules may be 

conveniently assessed by agarose gel electrophoresis. Briefly^ 9 plasmid, such as pSVp, 
is digested with restriction enzymes to yield a variety of fragment sizes. For ease of 
detection, the fragments may be labeled with either by filling in of the ends with 
DNA polymerase I or by phosphorylation of the 5Vend with polynucleotide kinase 

10 following dephosphorylation by alkaline phosphatase. The plasmid fragments are then 
incubated with the receptor-binding mtemalized Jigand/nucleic acid binding domain in 
this case, FGF2-3/poly-L'lysine in a buffered saline solution, such asl 20 mM HEPES, 
pH 7.3, O.IM NaCl. The reaction mixture, is. electrpphoresed on an agarose gel 
alongside similarly digested, but nonreacted fragments. If a radioactive label was 

15 incorp>orated, the gel may be dried and autor^dipgraphed. If no radioactive label is 
present, the gel may be stained with ethidium bromide and the DNA visualized through 
appropriate red filters after excitation with UV. Binding has occurred if the mobility of 
the fragments is retarded compared to the control. In the example case, the mobility of 
the fragments was retarded after binding with the FGF2-3/poly-L-lysine conjugate. 

20 Further testing of the conjugate is j>e;rformed to show that it binds to the 

cell surface receptor and is internalized into the cell. It is not necessary that the 
receptor-binding internalized ligand part of the conjugate retain complete biological 
activity. For example, HBEGF is niitogenic on certain cell types. .As discussed above, 
this activity may not always be desirable. If this activity is present, a proliferation assay 

25 is performed. Likewise, for each desirable activity, an appropriate assay may be 
performed. However, for application of the subject invention, the only criteria th^t need 
be met are receptor binding and internalization. 

Receptor binding and internalization may be measured by the following 
three assays. (1) A competitive inhibition assay of the complex to cells expressing the 

30 appropriate receptor demonstrates receptor binding. (2) Receptor binding and 
internalization may be assayed by measuring P-gal expression (e.g., enzymatic activity) 
in cells that have been transformed with a complex of a (i-gal containing plasmid 
condensed with a receptor-binding internalized ligand/nucleic acid, binding dommn. 
This assay is particularly useful for optimizing conditions to give maximal 

35 transformation. Thus, the optimum ratio of receptor-binding internalized ligand/nucleic 
acid binding domain to nucleic acid and the amount of DNA per cell may readily be 
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determined by assaying and comparing the enzymatic activity orp-gal. As such, these 
. ^ first two assays are useful for prelimitiary analysis and failure to show receptor binding 
.: . or IJ-gal activity does, not per se eliminate a candidate rfeceiptor-binditig internalized 
ligand/nucleic acid binding domain conjugate or fusion protein from furtHer anal^ 
. 5.-, ,(3>Thg::,pr^ferT^^ atcytdtoxicity assay performed on cells transformed with a 

' cytpciderenepding agentKbound - by/ r^^^ ligand/nucleic acid 

binding :4omaiii/uWhiJe/ in rgen used," nix)some- 

. :inactivatingiproteins -are preferred and ssapbrih, or another type f ribosortie-in^^^^ 
. protein, !? particularly preferred, - A statistically 'significant reduction in ceil humber 
10 .demonstrates the ability of the receptor-binding ihtemalized ligand^^ 
; domain conjugate , or ;fusi6n to deiiverttud^^ . cJ. . 

• ;C.^:r Other elements- ^"^ '-^-^'-'O-^ v-T^ - — ■ 
, ^ - 1. Nuclear traiisloibatjdn » ' 
' As used Hereih^k huclear^transloca^^ 
15 sequence of amino acids in a protein that are required for translocation of the protein 
into a^ ceil nucleus. ExampJes 6f NTS are set forth in table 2, biejow. Comparison with 
known NTSs, and if riecessiary testing of can^^ sequences, should per^^ those of 
' skill^ih the art to ireadily identify otfter amino aa^^ ftinction as NT^s. 

: • = As usfed herein, HeidMicigb^ from 
20 o the NTS th^t dcciirs m the' wild^tVp^]^ polyp^pticje^ or Fo^ example, the 

NTS^may bb derivbd from 'another poly^^ may be syi=ithesi2^dror i be 

, derived fr6m' another re^^ polypeptide. ^ A typical* consensus NTS 

- ' • 'sequence contains aji aminb-t^iTniHal proiine or glycine followed by at least thre^ basic 
* residues in a -arfay of sie to mhe amino icids (s'ee. e g. Dang et al! (1989) J. Biol 

-25 G/iem. 26^fl80lW8023; Danj^'et aL X1988) A/k Cell kol 8:AOA9-405^^ 2, 

- ' 'Which sets forth examjf)l& of 'NTSk share hornology with 

known NTSs), '''' ' •'' • '''' ' ■ ' " " " ' ' 



TABLE 2* 



. Source^ . . 


' -c:. - .'I Sequence - • ■ * - ' 


SEQ ID 
NO 


'~'Sy40 latge T 


126 , 

Pro LysLysArgLysValGlu 


67 


Polyoma large. T 


' ' 279 

: • ' Pro ProLysLysAlaArgGluVal 


68 
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Human c-Myc 


120 

Pro AlaAlaLysArgValLysLeuAsp 


69 


Adenovirus El A 


281 

Lys ArgProArgPro 


70 


Yeast mat a2 


Lys IleProIIeLys 


71 


c-Erb-A 


1: ■ 

A. Gly LysArgLysArgLysSer 

127 

B. Ser LysArgValAlaLysArgLysIeu 

C. Scr HisTrpLysGlnLysArgLysPhe 


72 

73 
74. 


c-Myb 


Pro LeuLeuLysLysIleLysGln 


75 


p53 


Pro GlnProLysLysLysPro 


76 


Nucleolin 


277 

Pro GlyLysArgLysLysGluMetThrLysGlnLysGluValPro 


77 


HIV Tat 


Gly ArgLysLysArgArgGlnArgArgArgAlaPro 


78 


FGF-1 


: . AsnTyrLysLysProLysLeu 


79 


FGF^2 


HisfheLysAspProLysArg: 


.80 


FGF-3 


AlaProArgArgArgLysLcu 


63 


FGF-4 


IleLysArgLeiiArgArg 


- 


FGF.5 


GlyArgArg 




FGF-6 


IleLysArgGlnArgArg ■ 




FGF.7 


lleArgValArgArg 


65 


VEGF189 


LysArgLysArgLysLys (in EXON VI) 


66 
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• VEGF206 


LysAxgLysArgLysLys (in EXON VI) ; 


66 


PDGF 


ProLysGIyLysHisArgLysPheLysHisThi ' ; 


> 



< ! 



.*:$uperscript.indicates position in protein 



5 



2. -'-Cytoplasm^trahslocaiion sigttWr' 

^^''P^-^™"^^^'®*^*'^" >quence is a sequence of amino acids in a 

protein that cause retention of proteins in the lumen of the endoplasmic reticulum 
, and/or translocate proteins to the cyt6soK._The signal sequence in mammalian cells is 
. KDEL (Lys-Asp-Glu-Leu) (Mu^^^^^ Cell -/*:899.907,. 1987). =Some 

i modifications pf this.sequehce:have-been-made vithout loss of activit^^^^ 
10 the sequences.RPEL (Arg-Asp-GIu-Leu) and KEEL (Lys-GJu-blu-Leu) confer efficient 
or partial retemion, respectively; in-plants (Deriecke ei alV/ fmfeo "7 7/-2345.2355 

r- -J -A- cytoplasm-traMSlcication-sighaa sbqu<»ice "nray "be' deluded ii^ ^rin' or for 
. ; V conjugates of HBEGF -with-a nueleic^acid binding domain, the sequehce may reside in 
j l 5 -either part -or both. If cleavable linkers' a^e used in the cdnjugate, . the cytoplasm- 
' . _;_;«ra!»focation_signal-- is Preferably included in sapprin^ec Jlift..nucieic.''iid.. binding 
• , Additionally, a c^oplasmic-translocation signal sequence may ;be included in 

I . J^BEGF, as long as it is placed so as not tP.interfere with receptor binding. ' 

. In addition, or alternatively, mcmbrane^isruptive peptides may be incorporated 
20 mtp complexes of HBE.GF.nucleic acid-binding domain and cytocide-encoding agent. 
Adenoviruses are known to enhance, disruption of endosomes. Virus-free viral proteins. 
_ s.uch,as.influenza -virus hemagglutinin HA^2, may be useful in the-present invention' 
Other proteins may be-tested in the assays described herein to find spec^ific endosome 
disrupting agents that enhance gene deliveo-. In general, these proteins and peptides are 
25 amphipathic {see, Wagner et s\i, Adv. Drug. Dei Rev. 14:\ 13-13'5, 1994).' • 
3. Linkers 

_ A 'S a peptide or other, molecule _that couples a HBEGF 

polypeptide to the targeted agent. The linker may be bound via the N- or C-terminus or 
an internal reside, but, typically within about 20 amino acids of either terminus of a 
30 HBEGF and/or targeted agent. The linkers provided herein increase intracellular 
availability, serum stability, specificity and solubility of the conjugate or provide 
increased fiexibility or relieve steric hindrance in the conjugate. For example. 
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specificity or intracellular availability of the targeted agent may be conferred by 
including a linker that is a substrate for certain proteases, such as a protease that is 
present in only certain subcellular compartments or that is present at higher levels in 
tumor cells than normal cells. 
5 In order to increase the serum stability, solubility and/or intracellular 

concentration and to reduce steric hindrance caused by close proximity of HBEGF and 
the targeted agent, one or more linkers is(are) inserted between the HBEGF protein and 
the targeted moiety. These linkers include peptide linkers, such as intracellular protease 
substrates and peptides that increase flexibiiity or solubility of the linked moieties, and 

10 chemical linkers, such as acid labile linkers, ribozyme substrate linkers and others. 
Peptide linkers may be inserted using heterobiofimctional reagents, described below, or, 
preferably, are linked to HBEGF by linking DNA encoding the substiate to the DNA 
encoding the HBEGF protein and expressing the resulting chimera. In instances in 
which the targeted agent is a protein, such as a RIP, the DNA encoding the linker can be 

15 inserted between the DNA encoding the HBEGF protein and the DNA encoding the 
targeted protein agent. 

Chemical linkers may be inserted by covalently coupling the linker to the 
HBEGF protein and the targeted agent. The heterobiftinctional agents, described below, 
may be used to effect such covalent coupling: 

20 a* Protease substrates 

Peptides encoding protease-specific substrates 2ire introduced between 
the HBEGF protein and the targeted moiety. The peptides may be inserted using 
heterobiofunctional reagents, described below, or, preferably, are linked to HBEGF by 
linking DNA encoding the substrate to the DNA encoding the HBEGF ^protein and 

25 expressing the resulting chimera. In mstahces in which the targeted agent is a protein, 
such as a RIP, the DNA encoding the linker can be inserted between the DNA encoding 
the HBEGF protein and the DNA encoding the targeted protein agent. For example. 
DNA encoding substrates s[>ecific for inmiceliular proteases has been inserted between 
the DNA encoding the HBEGF protein and a targeted agent, such as saporin. 

30 Any protease specific substrate (see, e.g., O'Hare et al. (1990) FEES 

273:?00-204; Forsbcrg et al. (1991) J. Protein Chem. 70:517-526; Westby et al. (1992) 
Bioconjuugate Chem, 5:375-381) may be introduced as a linker between the HBEGF 
p)olypept)de and linked targeting agent as long as the substrate is cleaved in an 
intracellular compartment.. Preferred . substrates include those that are specific for 

35 proteases that are, expressed at higher levels in tumor cells or that are preferentially 
expressed in the endosome. The following substrates are among those contemplated 
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fo^use ,„ accord ^v.th the methods herein: c^^^^^^^ 

• TT"^1^ substrate-ana " recombinant ^ subtilisin substrate' 

.(XaaAspGluLeu.SEQ ID NO. 50, particularly, PheAlaHistyr, SEQ ID NO. 49).' 

b. Flexible linkers and linkers that increase the solubilii, of the 

■•.C'.'.-.n'^ (i;!rc:-.Mo.*pnji|-gates!^-,.. - f.r^^ ^ •• 

.. .co^templaM .f:qr.^se.either alone or..with other linkers; such as the pfote^ specific 
subst^eliiikei?, Such Mrikers include.' butare hot limited to.- (Gly4Sef)n (Ser4Gly)n 



15 



: '■ ■ . r r/iXW : Gly4Ser SE0 ID-NO -40 ' ' ' • •• - ' • ■ ' ■'' 

CCATGGGCGG.CGGGGGGTCi: 6CCATGG. vc :• • . : ; 

; . . . •<2). "(Gly4Ser)2 SEQ -ID NO 41 ' - - ' ' ' 
CCATGGGCGG CGGCGGC.TCT: GGCGGCGGCG GCTCTGCCAT G6 ^ ' -^^ ■ ' ' ' • - • ' 
• • - • -. ; • . . (3), =-,.<Ser4Gly)4 SEQ-ID-NO 42 ' ' ^ 



•vch'.. .c= .:..;r.-.- -.- (4.). . rtX'SeP4Gny)2 SEQ- I-D'- NO' -AJ- ' ' ' 

CCATGGCCTC GTCGTCGTCG GGCTCGTCGJ/ CGTCGGGCGC tATGG ^ ^ - ' ^ ' ••'"'■^ 

^.f^-.,^E9^^I^.Noj48...'''''';^^^^^^ ^^^^^-^^^ 

: ! ' . ; vC. . K Heterobifunctional cross-linking reagents 
. ^ N'^werousAeterobifunctionaK.^^^^^ 

poyal.e«t..bpnd9,bsS«^,a»mo -groups-tod' thiOl groups and to introduce thior groups 
25. mto protems. are kno^vt,^to. those of skill in this' art fe. e.^, the PIERCE cAtaLOG- 
-IiT«i'^oTeclmolog.y..Catalog &.Handb^^^^^^^ which describes 0,e prep^tion 

, . . of .and .use of,such.|eagents and provides^a co^^ 

aIso, .e.^.,,CMmber,<j, a], (1992) Bioconjugaie Chem. 3397-461:7^^ ti al (1987)' 

.0 w T •« ^- (1987) Proc. NafL Scl *V:308-312- 

30...Walden:et al. (.198.6)^ . W. .C^^^ 

: ;7 75: 723,737; Mahan et al. 91987).^na/. fi,oc/,^;„. 762:163-170; Wawryzhaczak et 
al ^(1992).,^r. ,/:G««c.r.(5(5:361-366: Fattom et al. (1992) Jnfection & Jmmun. 60:5S4- 
. 589). , These reagents may , be used to form covalent bonds between Oie HBEGF 
polyp.ept,de(s) with protease substrate peptide linkers and t^geted^rotein agent. These 

' ^^ "^^ '^-^ N-^"ccihimidyl-3^(2-pyridyldithio)propionate 

. (SPDP; disulfide.: : linker);. sulfosuccinimidyl 6-[3-<;2.pyridyldithio)pri>pon- 
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amido]hexanoate (sulfo-LC-SPDP); succinimidyloxycarbonyl-a-methyl benzyl 
thiosulfate (SMBT, hindered disu?fate linker); succinimidyl 6-[3-(2-pyridyldithio) 
propionamidojhexanoate (LC-SPDP); sulfosuccinimidyl 4-(N- 

malcimidomcthyl)cyclohexane- 1 -carboxy late (siilfo-SMCC); succinimidyl 3-(2- 

5 pyridyldithio)butyrate (SPDB; hindered disulfide bond linker); sulfosuccinimidyl 2-(7- 
azido-4.methylcoumarin-3-acetamide) ethyl -l,3*-dithiopropionate (SAED); sulfo- 
succinimidyl 7-azido-4-mcthylcoumarin-3-acetate (SAMCA); sulfosuccinimidyl 6- 
[alpha-methyl-alpha-(2-pyridyldithio)toliiamido]hex (sulfo-LC-SMPT); 1 ,4-di- 

[3 -(2 -pyridyldithio)propionamido]butane (DPDPB); 4-succimmidyloxycarbonyl-a- 

10 methyl-a-(2-pyridylthio)toluene (SMPT^ hindered disulfate linker);siflfosuccinimidyl6[ 
a-methyl-<x-(2-pyridyldithio)toluamido]hexanoate (sulfo-LC-SMPT); m- 

malcimidobcnzoyl-N-hydroxysuccinimide- ester (MBS); m-maleimidobenzoyl-N- 
hydroxysulfosuccinimide ester. (sulfo-MBS); N-succinimidyl(4- 

iodoacetyl)aminobenzoate (SIAB; thioethcr linker); sulfosuccinimidyl(4- 

15 iodoacetyOamino benzoate (sulfo-SIAB); 5uccinimidyl4(p-maleimidophenyl)butyrate 
(SMPB); sulfosuccinimidyl4-(p-roaleimidophcnyl)butyrate (sulfo-SMPB); 

azidobenzoyl hydrazide (ABH). These linkers should be particularly useful when used 
in combination with peptide linkers,: such as those that increase flexibility. 

d. Acid cleavabie^ photocleavable and heat sensitive linkers 

20 Acid cleavable linkers include, but are hot limited to, 

bismaleimideothoxy propane; and adipic acid dihydrazide hnkers {see. e.g., Fattom et 
al. (1992) Infection & Immun. 60:584-589) and acid labile Uansferrin conjugates that 
contain a sufficient portion of transferrin to permit entry into the intracellular transferrin 
cycling pathway (see. e.g., Welhoner et al. (1991) J: BioL Chem, 266:4309-4314). 

25 Conjugates liidced via acid cleavable linkers should be preferentially cleaved in acidic 
intracellular compartments, sudi as the endosome. 

Photocleavable linkers are linkers that are cleaved iipon exposure to light 
{see, e.g., Goldmacher et al, (1992) Bioconj, Chem, 5:104-107, which linkers are herein 
incorporated by reference), thereby releasing the tai^eted agent upon exposure to light. 

30 Photocleavable linkers that are cleaved upon exposure to light arc known {see. e.g., 
Hazum et al. (1981) in Pept.. Proc. Eur. Pept. Symp., I6th, Brunfeldt, K (Ed), pp. 105- 
110, which describes the use of a nitrobenzyl group as a photocleavable protective 
group for cysteine; Yen et al. (1989) MakromoL Chem. 790:69-82. which describes 
water soluble photocleavable copolymers, including hydroxypropylmethacrylamide 

35 copolymer, glycine copolymer, fluorescein copolymer and methylriiodamine 
copolymer, Goldmacher et al. {i992y Bioconj. Chem. 5:104-107, which describes a 
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^/cross-linker and reagent that undergoes pHotolytic degradation upon exposure to near 
UV; light (350 nm); and Senter et al (19S5) Photochem. Photobiol 42:231-237, which 
describes: nitrobenzyloxycarbonyl chloride cross lirilcing reagents that produce 
photocleavable linkages)^ thereby releasing the targeted agent upon exposure to light. 

5 Such . ilinJecreilwould dennatologicial or ^ 'ophthalmic 

coiiditionOs ithatj can be ^exposed to light using fiber optics; Aft^f iadmimstration of the 
iconjugatev'the eye or skin bt other bddy piiit-c'ari bte'exposfed to light, resulting in release 

. . of the tar^etea If the toxic mdiety is 'a iight activated 

^porphyrin, light-exposure will alisto actiVjate the* p6rf)h>1iri, there^ cell death. 

,10 Use: of photocleavable linkers shotfld perinii^ddiihistra^ of higher dosages of such 

conjugatesxornpared to conjugates thdt^ refease a cytotoxic agent UF>6n ihteriialization. 

Heat sensitive linkers wbuld also have siiiriilar^'^^ 

' ■ Expression vectors' and host cell^ for expression of HBEGF or targeted 

^ ■ . ■■•'^ , . agents, .; mi. ■ C^'-' • • • 

15/, • : : . - As used herein, vector or plasmid refers to discrete elements that are 

used to introduce heterologous DN A into cells for either expression of the heterologous 

/ M :PNA or for replication of the cloned heterologous DN A. Selection and use of such 

vectors and plasmids are. well within the level- b^^^^^^ Expression refers to 

the /process . by , \vhicb Aueleic . acid is jtranscribed'^^ihto ^ mRNA' and translated into 

20 ipeptides, polypeptides, or proteins. - If the^nuCleic aiid is) derived from genomic DNX, 

expression may, if an appropriate; eukao^otlC;' host icellvo 

.,spHping.of4hejT4tNA. ^ , ; -t ^-- iWu'"; iJ- 

w ' - ^. As used^herein^ expre^ 

DN A ^fragments AaV' are; inr operative linkage i with regulatory sequences,- such as 

25. promoter, regions, that .are capable of effecting expression of such DNA r ftagments. 

Thus, an expression vector refers to a recombinant DN A or RN A constnict, such as a 

plasmid,. a phages recombinant; virus , or other vector that, upon introduction into an 

. . . .appropriate host cell, results in expression of the cloned.DNA. Appropriate expression 

.^vectors are well knowi to those .of skill inthe art and include those that are replicable in 

30 , eukaryotic. cells aind/or prokaryotic. cells and. those that remain^ episomal or may 

.: - integrate into the host cell genome. v " - 

As . used , herein, operative linkage or operative i association of 

heterologous DNA . to regulatory and effector sequences of nucleotides, such as 

.promoters, enhancers, .transcriptional and /translational stop sites,, and other signal 

35 sequences, refers to the ftmctional relationship between such DNA* and such sequences 

,, of , nucleotides. 7 For. example, operative linka^^ of heterologous DNA :to a promoter 
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refers to the physical and functional relationship between the DNA and the promoter 
such that the transcription of such DNA is initiated from the promoter by an RNA 
polymerase that specifically recognizes, binds to and transcribes the DNA in reading 
frame. For example, the NTS may be derived from another polypeptide, it may be 
5 synthesized, or it may be derived from another region in the same polypeptide. 

As used herein, transfection refers to the taking up of DNA or RNA by a 
host cell. Transfonriation refers, to this process performed in a manner such that the 
DNA is replicable, either as an extrachromosomal element or as part of the 
chromosomal DNA of the host. Methods and means for effecting transfection and 

10 transformation are well known to those of skill in this art (see, e.g.. Wigler et al. (1979) 
Proc. Natl. Acad ScL USA 7^:1373-1376; Cohen et al. (1972) Proc, Natl Acad. Sci. 

USA 69:2\\Q), 

DNA encoding the selected HBEGF or a portion thereof, HBEGF 
conjugate or polypeptide targeted agent is inserted into a suitable vector and expressed 

15 in a suitable prokaryotic or eukaryotic host. Ntimerous suitable hosts and vectors are 
known and available to those of skill in this art and may be purchased conunercially or 
constructed according to published protocols using well known and available starting 
materials. Suitable eukaryotic host cells include insect cells, yeast cells, and animal 
cells. Insect cells and bicterial host cells are presently preferred. Suitable prokaryotic 

20 host cells include £. co//, strains of Bacillus and Streptomyces. 

The plasmids used herein must include a promoter in operable 
association with the DNA encoding the protein or polypeptide of interest and are 
designed for expression of proteins in a bacterial host. A promoter region refers to the 
portion of DNA of a gene that controls transcription of DNA to which it is bperatively 

25 linked. A portion of the promoter region includes specific sequences of DNA that are 
sufficient for RNA polymerase recognition, binding and transcription initiation. 
Promoters, depending upon the nature of the regulation, may be constitutive or 
regulated. For use herein, inducible promoters are preferred. The promoters are 
recognized by an RNA polymerase that is expressed by the host. The RNA polymerase 

30 may be endogenous to the host or may be introduced by genetic engineering into the 
host, either as part of the host chromosome or on an episomal element, including a 
plasmid containing the DNA encoding the saporin-containing polypeptide. Most 
preferred promoters for use herein are tightly regulated such that, absent induction, the 
DNA encoding the saporin-containing protein is not expressed. It has been foimd that 

35 tightly regulatable promoters are preferred, for expression of saporin. Suitable 
prompters for expression of proteins and polypeptides herein are widely available and 
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, are^^yeIl known in the art. For expression of the proteins such promoters are inserted in 
.. a plasmid m operative linkage with a cbntroJ region such ak the lac opdrbn. Prefeired 
. . , promot^ regions are..those that are inducible and functidhkl in £ co// or ^ly genes in 
. ^. yectors.ofviral.,r:gin. Examples df suitable indtic 

5 mclijde, ,M..are.wt. limited to: .the-^i>cb/; I* <^r responsive to isopropyf B 
. . .-P:AiogalaQtopyi3noSide(IPTG;see.reta: N^ 75. 1 109-11 17)- 

iho, metallothionein promoter metalcreguIat&ry-elem^ntsHesporisive to heaVy-metal' 
, M.g., zinc) mductipn.(^.e:.e:^, -U.S. Patent No. 4,870.009 to EvaiisV/ a/.); the phage 
,n ^^^""^^ '"^^^^-^ IPTG (see, 'e.^,' U.S. Patent No. 4,952,496; knd* sWdier et 
10 .1. (1990).^../,. En^ol: /55.-60-89) ahd the W prbmbW. Other' pkimoters include 
but^are not limited to. the'T7 phage promoter 

^ the . T5 and SP6 promoters, the trp, Ipp, and lac promoters, siith as the' lacUV5 
. ; .from;£.. cph; the^PlO or polyhedrin- gene promoter of baculovi'rui/insect cell expression 
f,^?ilr •^•^••■•^••S-P««>fNos..5^43,04K 5^4i,5687/5;266;3lX 4>45:fe,; and 
15 . 5,1.69,784) and mducible promoters from other eukaryofic^^xp^^^^^ 

The DNA construct is introduced into a plasmid suitable for expression 
.. , m the selected host. .The sequences of nucleotides in the pii^ids that are ^^latory 
p'ons, such as prom<.iers and oper^drs;^ opeiWidn^liy a^s6^i^e<l^ 
, ,.for;transc.npt.on. T3ie sequence. <jf Nucleotides^ ^^it^^^^^^^ 
20 or cytotoxic agem may also include DNA encbdihg a'^sedr^tiiin^y^^ the 
, • resultmg pept.de .is a precursor pTotein; Secretion si^rials suitab^ for use are widely 
?va,lable..andtare.!w^ll known in the art: Secretion signal refers "to a peptide region 
withm the P.repurspi:,pmtein. that ^directs . secretion of precursor protein from the 
. , cyt9plasm,,of.,the. host :««it«..the..-peripltohi-c -^e- or into the extiacellulW growth 
25 medium,,. Such^signals may be/either at the amino terminus or carboxyl tennifiiis of the 
. ..precursor protein. .The. preferred secretion signal 4s linked tothe amino terminus and 
ipay be heterologous to. the protein to which it is'linked. Prdkaiyotic arid euka^votic 
, sepr^tion^signals functional in to/r,-may be employed. The presently preferred 
secretion signals include, but are not limited to, those encoded by the following E coli 
30.. genes: ,.,ompA,: OmpT,.ompF. ..ompC, beta-lactamase, pelB and bacterial alkaline 
phosphatase, and theJike (von Heijne (19»5) J Mol. Biol. 184:99-\0S). In addition the 
bactenal pelB geneseeretion signal (Lei etal. (1987);/. fiflc/er/o/. 7tf^^^^^ 
secretion, sig.nal,,and the. cek2 secretion signal, functional in iniect cells, may be 
employed, , The .most preferred secretion signal for bacterial expression is the £ coli 
35 ompA secretion signal. . For eukary^otic expression systems, partii^Wlarly insect cell 
.^systems, the sigpals. from secretednproteins. such as insulin, groWth hormone: mellitin. 
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and mammalian alkaline phosphatase are of interest herein. Other prokaryotic and 
eukaryotic secretion signals known to those of skill in the art may also be employed 
(see, e.g., von Heijne (1985) J. Mol Biol 184:99A0Sy Using the methods described 
herein, one of skill in the art can substitute 3ecretiQn signals that are functional in either 
S yeast, insect or mammalian cells to secrete, the. heterologous protein from those cells. 
The resulting processed protein may be recovered from the periplasmic space or the 
fermentation medium or growth medium. ^ > 

The plasmids may also include a selectable marker gene or genes that are 
functional in the host. A selectable marker gene includes any gene that Confers a 

10 phenotype on bacteria that allows transfomied bacterial cells to be identified and 
selectively grown from among a vast majority of untransformed cells. Suitable 
selectable marker genes for bacterial hosts, , for example, include the ampiciilin 
resistance gene (Amp^), tetracycline resistance gene (Tc^) and the kanamycin resistance 
gene (Kan^). The kanamycin resistance gene is presently preferred. 

15 Particularly preferred plasmids for transformation of E. coli cells include 

the pET expression vectors (see, U.S patent 4,952,496; available from Novagen, 
Madison, WI; see, also literature published by Novagen describing the system). Such 
plasmids include pET 11a, which contains the T71ac promoter, T7 temiiriator, the 
inducible E, coli lac pp^ratpr, and .the lac repressor gene; pET 12a-C4 which contains the 

20 T7 promoter, T7 terminator,. .and -tthe, £. ,co// ompT secretion signal; and pET 15b 
(Novagen, Madison, WI)^ which contains a His-Tag^^ leader sequence (Seq.- ID NO. 
23) for use in purificat.ion with a ^s column and a thrombin cleavage site that permits 
cleavage following purification over the colunm; the T7-lac promoter region and the T7 
terminator. 

25 Other preferred plasmids include the pKK plasmids, particularly pKK 

223-3, which contains the,TA<^ promoter, (available from Pharmacia; see also, Brosius 
et al. (1984) Proc. Natl. Acad Sci. 81:6929; Ausubel et al.. Current Protocols in 
Molecular Biology; U.S. Patent Nos. 5.122,463, 5,173,403, 5,187,153, 5,204,254, 
5,212,058, 5,212,286, 5,215,907, 5,220,013, 5,223,483, and 5^29,279). Plasmid pKK 

30 has been modified by insertion of a kanamycin resistance cassette with EcoRl sticky 
ends (purchased from Pharmacia; obtained from pUC4K, see, e.g,, Vieira et'al..(1982) 
Gene / 9:259-268; and U.S. Patent No. 4,719,179) into the ampiciilin resistance marker 
gene. 

Other preferred vectors include the pPL-lambda inducible expression 
35 vector, pTrc99A, and the tac promoter vector pDR450 (see, e.g. U.S. Patent Nos. 
5,281,525, 5^62309, 5,240,831, 5,231,008, 5.227,469, 5,227,293, ; available from 
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. Ph,armac,a P.L.Biochemicals, see; also Mott/etal. (1985) P^^^^^^ 5c/ USA 

«!?2;88; and De Boeret al. (1983) Proc. Natl. Acad Sci. 'aS.A. 80:21); baculovims 

.vectors,,such,.as a.pBlueBac vector (als6 cilled pJVETL and denvatives thereof see 

!tv!:;^" patent .Nosc..- 5,278 :050, 5,244,805. 5;243,04ir ^5^42.687; '5^66317; 
. 4,745,05 ^.Md 5.11 69,.784>^^ HI - \ . . ■ • . : 

' ' : <^eriplasmids inchJde the pW-IIIompA plasmids (see, U.S. Patent No 

4,575.013 to Inouye; see, also, Duffaud'et-al. (1987) A/e/A. £«z. /5J. 492-507) such as 

pWlIompA2 . The. pIN-IIIompA'plakniids incltide an irf^rtion site for heterologous 

„ PNA linked in^transcriptiohal reading fianie with fundiond figments deriveii from the 

hpoprptein gene ;of £ co//...-Th^^Jla^ids'also include a DNX fiagment coding for the 

; signal., peptide- of the ompA rproteih 6f E. cb//, positfoned sich tKat the desired 

polypeptide is expressed with the 6nipA «gnal 'peptide ^t its kinino teWinus, thereby 

allowing efficient secretion acJmstf tte'«^asmic-,^cmbriifae: ' -Tte' ji&iids 'farther 

include DNA encoding a specific segment of th^ £. coli l4^ ^^moter^^^t6r, which is 

positioned in the proper . orientation T6r-trahscrip^^ of the desired 

polypeptide, as well as a s^arate functional £^r^// lacl gene eiici>ding the associated 

repressor .molecule that, in thoabsfence of Kc op&on li^uiid^^interacts with the lac 

prommer-operator to prevent transcription therefroni. icpi4si6ii^ ^ th^ desired 

polypeptide is urider. the conu-ol>.of ^the K^>6prbtfcirr tijip^^^ ,3e 

ipromoter-operator, although transcrtptioH frcim 'hither pMb^^ 2 nc^^^ blocked !^ 

■■ : ;the repressor molecule..The repressor is selectively iriadi^^td bf^ of an ihducer 

. . inolecule thereby inducing .transcriptionaT fej^re^^ f^^^ 
both promoters. vi **;' v/. ;;t j. ■ i; * . - .-wx-.:.: \ v ...--i f s 

- . repressor protein may be encoded by the plasmid containing the 

25 cpnstruct.or a second plasmid thit contains a gene eilboding for a repressor-protein. 
... The, repressor-protein is capable of repres^in^ the tiankcfiptioii' of a promoter that 
: contains sequences of nucleotides to which the repressor-protdn binds: TTie promoter 
can. be derepressed > by- altering the physiological conditions of the cell. The alteration 
. can. be accomplished'by the addition' to the' growth medium of a moleciil^ that inhibits 
30. for example, the ability to interact with the operator oF with regulatory' proteins or other" 
J . regions of the DNA or by altering the temperature of the ^groWth media. Preferred 
. repressor-proteins include, but are not limited to the £ coli laci repressor responsive to 
IPTG induction, the temperature sensitive cI857 repressor, and the like. The £. coli lad 
repressor is preferred. : . '\ _ ; . • . ■ . . 

■ ' ^" <=ertain preferred embodiments, the constmcts also include a' 

. transcription terminator sequence. A transcription itemiinator region has either (a) a 
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subsegment that encodes a polyadenylation signal and polyadenylation site in the 
transcript, and/or (b) a subsegment that provides a transcription termination signal that 
terminates transcription by the polymerase that recognizes the selected promoter. The 
entire transcription terminator may be obtained from a protein*encoding gene, which 
5 may be the same or different from the gene, which is -the source of the promoter. 
Preferred transcription terminator, regions are those that; .are; functional in' Ei coli. 
Transcription terminators are optional components of the expression systems herein, but 
are employed in preferred embodiments. The promoter regions and transcription 
terminators are each independently selected from the same or difiimnt genes. In some 

10 embodiments, the .DNA fragment is replicated in bacterial cells, preferably in E. colL 
The DNA fragment also typically includes a bacterial origin of replicatibti, to ensure the 
maintenance of the DNA fragment from generation to generation of the bacteria. In this 
way, large quantities of the DNA fi-agment can be produced by repHcation in bacteria. 
Preferred bacterial origins of replication include,, but are not limited to, the fl-ori and 

15 col El origins of replication. 

Preferred bacterial hosts contain chromosomal copies of DNA encoding 
T7 RNA polymerase operably linked to an inducible promoter, such as the iacUV 
promoter (see, U.S. Patent No. 4,952,496). Such hosts include, but are not limited to, 
' lysogenic E cqU strains HMS174(DE3)pLysS, BL21(DE3)pLysS, HMS174(DE3) and 

20 BL21(DE3'). Strain BL2 J (pE3) is preferred The pLys strains provide low levels of T7 
lysozyme, a natural inhibitor of T7 RNA polymerase. Preferred eukaryotic hosts are the 
insect cells Spodoptera frugiperda (sf9 cells; see, e.g., Luckow et al. (1988) 
Bio/technology (5;47-55 and U.S. Patent No. 4.745,05 1 ). 

For insect hosts, baculovirus vectors, such as a pBlueBac vector (also 

25 called pJVETL and derivatives thereof), particularly pBlueBac III, {see, e.g., U.S. 
Patent Nos. 5,278,050, 5,244,805, 5,243,041, 5,242,687, 5,266,317, 4,745,051, and 
5,169,784; available from INVITROGEN, San Diego) may also be used for expression 
of the polypeptides. The pBlueBacIIl vector is a dual promoter vector and provides for 
the selection of reconibinants by blue/white screening as this plasmid contains the p- 

30 galactosidase gene (lacZ) under the control of the insect recognizable ETL promoter 
and is inducible with IPTG. A DNA construct is introduced into a baculovirus vector 
pBluebac 111 (INVITROGEN, San Diego, CA) and then co-transfected with wild type 
virus into insect cells Spodoptera frugiperda (sf9 cells; see, e.g., Luckow el al. (1988) 
5io//ec/i/ia/o£y 6. 47-55 and U.S. Patent No. 4.745,051). 

35 Other baculovirus vectors, such as pPbac and pMbac (available from 

Stratagene, San Diego, CA, see, also Lemhardt et al. (1993) Strategies 6:20-21, and the 
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Stratagene. Catalog page 218), which contain the htoian alkaline phosphatase (see eg 
Bailey .et al. (I989).Proc. Natl. Acad. Sci.^U. S. )i: 86:12-26) an^rn^ {see eg. 
Tessier- ?tiaI..(l99i):Gei,e P*; 177- 183) secretary signals insehed into tfie BamHI and 
,. ^. , . Ndel ^shes^crespecti vely of pJVP 1 OZ '{see; e:gfi KaW^oio et al. (1 ^9 1 ) Biochem. 
.5 .„ BiQphys. ;/?cxr.Commi/«2775A 756-63, ^^Je^a ^ 7-/0.267-272, are also 

„ suit^ble,for;usevherein7particularIy -if sebie^ deiirtd. Insertibn of genes into the 
, .r'^'na^^^^mHt sitesrof .these 'Vectors res^^^ ^1,^ 
..... insect, cell secretory pathway, which processes ■ the prd-polypeptide ^s^^ 

.peptide pr:.fusiorf protein is secreted into the groWili mediiim. Other heterologous signal 
10 sequences, such asi'thelinsulin'signal iSequence-(iee. e:^.^ for 
; DNA encoding the signal sequence); thef -growth hbrtiidn^ si^al Wquence, marr^ 

alkalixnc phosphatase, ;the.mellitin signal ^uence^an^^ others' that are processed by 
insecj.cells are used.- ■>■ /:■■"■■■ -''^^ \ '- 

DNA encoding full-length- HBEGF: HBEG>-gAP^ SAi^-riBEGF' with 
15 and without linkers, and other such constructs, has been introdiited^into the pET vectors 
^ pET 11a (Novagen, Madison, Wl). DNA encodirig SAP h^ klkb* been intiodwced in 
M pEi:, 15b. (Novagen, Madison,'iWI). ' i - ^ ■ . 

c> ' i!,n ; V Some of^the cdnstfucts provided herein hav^ also 1^^^^^ the 
1,.: bacjjlovii^ vector .sold.commercially urideP'^ dnyittogen;* San 

20 .I>iegO:-CA;;see the Invitrogeh tGataiog^ieeV Vialard Vir6i:^4]31:, see^ 

alsQ.^}J,Sr?at^nt:No. 5,270;458;r.U.S: PtoNb. 5,243,b4i;^ahd^,iblish^d internaiL^ 
PGT Application WO 93/1 01 39, which is ba^ed oh U.S. baterit apjlication Serial No. 
07/792.600. The pBlueBacIIt vectbr' is a' d»ial' promoter vect^ the 
. . selectipncof recombinants^nby . -blue/white contains the (3- 

25: galactosidase. gene (lacZ) under the ccntri>l' of the' insect Weogriizabie' ETL promoter 
and is inducible .with IPTG.. - The HBEGP cbnsthict or other construct i^ inserted into 
this yectoc under control of the polyhedrin p^mote^. the' DNA is Uen cotransfected. 
, - such as ;b> CaP04 transfection 'br liposomes. 'into Spodoptera jhigiperda cells .(sf9 
cells), with wild typeibaculovirus and grown in tissue culture flasks of in. suspension 
30,^> cultures rBlue occlusion- minus viral plaques are selected and plaque purified and 
. ..screened for the presence of HBEGF-cncoding DNA by aiiy standard methodology, 
such, as;v.estem blots using HBEGF anti-sera or Southern' blots using an appropriate' 
HBEGF, probe. DNA encoding an HBEGF witli and without linkers is introduced into a 
Bluebac vector for expression in baculovirus. Details ^e set forth in the Examples. 
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E. Methods of preparation of HBEGF-targeted agent conjugates 

Cytotoxic conjugates with linked targeted agents can be prepared either 
by chemical conjugation, recombinant DNA technology, or combinations of 
recombinant expression and chemical conjugation. The methods herein are exemplified 
5 with particular reference to HBEGF and saporin. It is understood, howeyen that the 
same methods may be used to prepare and use conjugates of any HBEGF polypeptide 
with any cytotoxic agent, such as a RIP, a nucleic acid or any other targeted agent either 
directly or via linkers as described herein. The growth factor and targeted agent may be 
linked in any orientation and more than one ^owth factor and/or targeted agent may be 

1 0 present in a conjugate. 

Conjugates that contain one or more HBEGF polypeptides linked, either 
directly or via a linker, to one or more targeted agents are provided. The presently 
preferred HBEGF polypeptides are those having sequences set forth in SEQ ID.NOs. 1- 
5. Human HBEGF is particularly preferred. 

15 The conjugates provided herein contain the following components: 

(HBEGF)n, (L)q, and (targeted agent)m in which: at least one HBEGF moiety is linked 
with or without a linker (L) to at least one targeted agent, n is 1 or mor^, typically is 
between 2 and 6, generally 1 or 2; q is 0 or more as long as the resulting conjugate binds 
to the targeted receptor, is internalized and delivers the targeted agent, q is generally 1 

20 to 4; m is 1 or more, generally 1 or 2; L refers to a linker, and the targeted agent is any 
agent, such as a cytotoxic agent or a nucleic acid^ or a dmg, such as methotrexate, 
intended for internalization by a cell that expresses a receptor to which H3EGF binds 
and upon binding is internalized. 

It is understood that the HBEGF and targeted agent (or linker and 

25 targeted agent) may be linked in any order and through any appropriate linkage* as long 
as the resulting conjugate binds to a receptor to which HBEGF binds and internalizes 
the targeted agent(s) in cells bearing the receptor. , . 

For example, the HBEGF polypeptide may be linked to the targeted 
agent or linker at or near its N-terminus or at or near its C-terminus. The HBEGF may 

30 be linked to a second HBEGF polypeptide, which may be the same or a different 
HBEGF polypeptide; and one or more targeted agents may be linked to the HBEGF or 
may be linked to each other. The linkage may be at any locus, although the C- or N- 
terminus region of HBEGF (within about 20^ preferably 10, amino acids from the 
terminus) is preferred. If more than one targeted agent is included, the second agent 

35 may be the same or different fi-om the first agent.. 
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^ In some embodiments, the conjugates provided herein may be 
- represented by the fonnulae (I): ' • ' - ^ - 

' ■ (HBEGFn<L)q-targeted agentmip^ ' ' ' ' 
n.::- . -r,. . ... ^ Which'HBEGF reifeans to a polypeptide that is reactive with a HBEGF 
-5 ^^"receptor (aiso refeired to hereiri-ats a HB£GF polype^^^^^ such as HBEGF, L refers to 

- • a:^ii^^^ be i^sefft br'abserit; q is 0 or more as long as the resulting 

- f cbhjugate binds'to a targets rec^ptdf and the targeted agenf is internaliK^ m. n and p 

"ar^vlridependehtiy, I'or nidre; ai^^ geherally less than' or equal to 4^ and preferably 1 or 
2' and the targeted agent is any agent, such as a cytotoxic agent or a nucleic acid, or.a 
10 drug, such as methotrexate, intended for internalization by a cell that " expresses a 
' HBEGF receptor; -Mid the MBEGF-ih^^ li'nker 'of targeted agent via its 

N-terminus or C-terminus or any 6ther locus in polypeptide," such W derivati cys 
residues. When p is 2, the conjugates are preferably linked via cysteine residues present 
or introduced into HBEGF. ' ' ' - • ■ / 

^ ^ " <^'°"j"8ates t)f the formula ('n):^^a^ 

which m, n, p and I are as defined above, are also i^rovided/ "These conjugates are 
-prepared by chemical conjugation or by preparing fijsion. proteins from DNA constructs 

> . that encode one or two' HBEGF mo^^^ j - ■ ■• • ' - ' • 

V '' ' " I" ^<ltliti^n. cbnjugaWs in^ 

20 - pdlypeptiaes and/or targeted dgent, iif the agent is a poiypepiide';^^^ 
' ! or other conservative substitution are provided. , Compositions of such 

• ' conjugates should exhibit reducieid aggregation compared to conjugates that contain non- 
essential cysteines. Non-essential cysteines may be identified em^^^ 

The iinicer i^ seleittfed to increase the . specifi toxicity, solubility, 
25 serum stability; and/dr jhtracelluiir availability of the/targeted moiety. More preferred 
'linkers are those that can be' incorporated In fusion proteins and expiressed in a host cell, 
such as E. coll. Such linkers include: enzyine substrates, such as cathepsin B substrate, 
' catheipsin D substrate, trypsin substrate, thronibin substrate, subtilisin substrate, factor 
' Xa substrate, aiad ^nterokinase substrate; linkers that increase solubility, flexibility. 
30 ' and/or intracellular cleavifbility, such ks (giymser)n and (serni^ly)n, in yvhich n is I to 6. 
■ preferably 1 to 4, most preferably 1, and m is 1 to' 6, preferably 1 to 4.' more 
preferably 4. Preferred amonjg such linkers, are Uiose, such as cathepsin D substrate, 
that are preferentially cleaved in the endosom'e or cytoplasm followihg internalization 
of the conjugate linker; dtlier such linkers; such as (glymser)n and (sermgly)n, also 
35 increase the flexibility, serum stability and/or solubility of the coiijugate or the 
availability of the region joining the HBEGF and targeted agent for cleavage. In some 
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embodiments, several linkers that are the same or different may be included in order to 
take advantage of desired properties of each linker. 

Other linkers are suitable for incorporation into chemically produced 
conjugates. Linkers that are suitable for chemically linking conjugates include 
5 disulfide bonds, thioether bonds, hindered disulfide bonds, and covalent bonds between 
free reactive groups, such as amine and thiol groups. These bonds are produced using 
heterobifunctional reagents to produce reactive thiol groups on one dr both of the 
polypeptides and then reacting the thiol groups on one polypeptide with reactive thiol 
groups or amine groups to which reactive maleimido groups or thiol groups can be 

10 attached on the other. Other linkers include acid cleavable linkers, such as 
bismaleimideothoxy propiane, acid labile-transferrin conjugates: and adipic acid 
diihydrazide, that would be cleaved in more acidic intracellular compartments and cross 
linkers that are cleaved upon exposure to UV or .visible light and linkers. 

The targeted agents or moieties include any molecule that, when 

1 5 internalized, alter the metabolism or gene expression in the cell . Such agents include 
cytotoxic agents, such as ribosome inactivating proteins DNA encoding cytotoxic 
agents, and antiscnse nucleic acids, that result in inhibition of growth or cell death. 
Other such agents also include antisense RNA. DNA, and truncated proteins thai alter 
gene expression via interactions with the DNA, or co-suppression or other mechanism. 

20 The conjugates herein may also be used to deliver DNA and thereby serve as agents for 
gene therapy or to deliver agents that, upon, transcription and/or translation thereof, 
result in cell deaUi. C)^otoxic agents include, but are not limited to, ribosome 
inactivating proteins, inhibitors of DNA, RNA and/or protein synthesis, including 
antisense nucleic acids, and other metabolic inhibitors. In certain embodiments, the 

25 cytotoxic agent is a ribosome*inactivating protein (RIP), such as, for example, saporin, 
although other cytotoxic agents can also be advantageously used. 

The targeted agents may also be modified to render them more suitable 
for conjugation with the linker and/or a HBEGF protein or to increase their intracellular 
activit\ . Such modifications include, but are not limited to« the introduction of a Cys 

30 residue at or near the N-terminus or C-terminus, derivatization to introduce reactive 
groups, such as thiol groups, and addition of sorting signals, such as (XaaAspGluLeu)n 
(SEQ ID NO. 50), where Xaa is Lys or Arg, preferably Lys, and n is 1 to 6, preferably 
1-3. ai, preferably, the carboxy-terminus {see, e.g., Seetharam et al. (1991) J. Bioi. 
C/ie/77. 266:17376-17381; and Buchner et al. (1992) AnaL Biochem. 205:263-270), that 

35 direct the targeted agent to the endoplasmic reticulum or the addition of a cytoplasmic 
sorting sequence, such as KDEL (see discussion herein). 
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- - i : Conjugates that contain a plurality of HBEGF polypeptides linked to the 

cytotoxic agent are also provided. These conjugates that cdntain several, typically two 
; to abftut six/monomers can be produceid by linking inultiple copies of DNA encoding 
. the .IJBEGF'.fusi6h .protiein CWder the citihtrol of a single promoter region. 

.= 5, ,In additionxonjugates-tfiai:'^^^^ 

SAP,HBEGF^SAPiiiiikfedwith orWithout linkfei-^W^ ' 
li ' ir> Cheliiiical cbiijig^^ " ' ' 

Preparation of HBEGF polypeptides for chemical 

' ,• -i'.' -""'''cbkijtiigfal^dn ■ ■■'-^ ' 

10 ,t . HBEGF nray Be isdiated frciih ^ siiitable' s^mce or may be produced 

using recombinant/DNJ^ meth6d6Jbgy; kistu^^^ /. .^,1 - .;■ r ... .. 

: . ; ! . : As used'hereitti ''siifeUihiiafly' jit^-^^^ 

appear free, of "readily idetiidtableihipuriUeS Ijy standard methods of 

analysis, . such as thin layer ' chrbmatography^ ^aLCy/ gef e^^^ high 
15 performance liquid chromatography (HPLC), iisQd by diose of siclli in' Ae art'to assess 
such purity, or sufficiently pure such that flirthCT pimficatibri wouW nm detectably alter 
r the physical arid chemical 'prbpertik; such as enzymatic 'knd b»ioloi^cal activitie^; of the 
i,; substance. ; Methods for purification bf'- the^ 'coinpouWds^ pr^^uce ' substantiallj 
chemically pure compounds' W kiibWri *b-ffibse Wsl^T^ flie .St"' A 
chemically pure compound m^y. hbWevbl--lib'^ mlxto^^ol^'^^SiS' In' such 
: instances, further liunfieatidrilmght ihb^^e tfie^^ciiic activity of S^ec^mpq^^^ 

f t ^ ' 'to effed chemical ''dd^^^ i5 
■••conjugated generally via a reactive afhin^ group of thiol group to agent or 

, ' to a linker,.which has bieen 6if is^iibseliu^htly linked to tfie targeted agent. The HBEGF 
25 polypeptide is coi^ugated- eitheir i^a 'i^ Nitei^ the 
polypeptide. In preferred embodimeiits, the Mbe6V polypeptide^ i^ a 
. reactive cysteihe residue to th^e linker or to the t^^etSi ageht. '"The HBEGF can also be 
^ modifiedtby: addition of a cysteine r^iduc'either by replacing a residue 'or by inserting 
the cysteine,-at or h^ar'the dmiiio 6r carboxyl temiiniis: within about 2^ preferably 10 , 
30. residues froM either end, arid pitferably at br liear the amino terminus. ' 
; • In: order to ifeduce the heterogeneity of preparations, the HBEGF 

polypeptide can be modified by riiutagenesis to i-eplace reactive cysteines, leaving, 
preferably, only? one available cysteine for reaction The HBEGF polypeptide is 
modified by deleting oi= Replacing a site(s) on the HBEGF that caVises the heterogeneity. 
35. : Such sites are' typicaiy cystine' residues that, up^^ foldihg bf the protein, remain' " 
available for interaction with other cysteines br for int(sractibn with more tiian one 
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cytotoxic molecule per molecule of HBEGF polypeptide. Thus, such cysteine residues 
do not include any cysteine residues that are required for proper folding of the HBEGF 
polypeptide or for retention of the ability to bind to a HBEGF receptor and internalize. 
For chemical conjugation, one cysteine residue that, in physiological conditions, is 

5 available for interaction, is not replaced because it is used as the site for linking the 
cytotoxic moiety. The resulting modified HBEGF is conjugated with a sinfgle' species of 
cytotoxic conjugate. Alternatively, the contribution of . each cysteine to the ability to 
bind to HBEGF receptors may be determined empirically as described herein. 

b. Preparation of targeted proteins for chemical conjugation 

10 If the targeted agent is a polypeptide it misiy be directly linked to the 

HBEGF or HBEGF with linker or to a linker by reaction of a reactive group in the 
polypeptide.. It is desirable, however, that the agent may react at only a single locus, so 
that the resulting preparation of conjugates is homogeneous. Thus, if necessary, the 
targeted agent can be derivatized and then a single species isolated, or can be modified 

.15 so that it only has one reactive group, such as a cysteine, for a particular set of 
conditions and reagents. For example, saporin has been derivatized and a single species 
isolated. Saporin has also been modified by introduction of a single cysteine residue. 

: For chemical conjugation,, the SAP may be derivatized or modified such 
that it includes a cysteine residue for conjugation to the HBEGF protein. 

20 Saporin for chemical conjugation may be produced by isolating the 

protein from the leaves or seeds of Saponaria officinalis or using recombinant methods 
and the DNA provided herein or known to those of skill in the art or obtained by 
screening appropriate libraries {see, e.g.. International PCT Application WO 93/25688, 
which describes the isolation of saporin, plasmids containing DNA encoding sajwrin, 

25 expression of saporin and isolation of purified saporin). Sonie DNA encoding saporin 
may also include an N-teraiinal extension sequence linked to the amino terminus of the 
saporin that encodes a linker so that, if desired, the SAP said linker can be expressed as 
a fusion protein. The sequence of DNA encoding saporin is set forth in SEQ ID Nos. 
8-12. 

30 The DNA molecules provided herein encode saporin that has 

substantially the same amino acid sequence and ribosome-inactivating activity as that of 
saporin-6 (SO-6), including any of four isoforms, vAnch have heterogeneity at amino 
acid positions 48 and 91 {see, e g,, Maras et al. (1990) Biochem. IrUernat, 27:631-638 
and Barra et al. (1991) Biotechnol Appl Biochem. 75:48-53 and SEQ ID NOs. 8-12). 

35 Other suitable saporin polypeptides include other members of the multi-gene family 
coding for isoforms of saporin-type RIPs including SO-1 and S03 (Fordham-Skelton 
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ttS^^ H : ^^^^'^^-'38), SO-2 W. e.,.. as: Application Serial 

n oo?! !! ; t ' <^o-esponds to GB 2.216,891; see, also, Fordham-Skelton et al. 
imi) .Gmv.C^;ieA.22P:460-466);SO-4 (^.^ 

. tappi et al, C1.9.85;^/acW ^V>M^. Commt^n. •y2P:934l942f^d SO-5"^ee eg. 
5,,,.:^194;24J;B; see^ralsd, Mbhtecfucchi et al (1^8^ M\/ 
, .JJ:263.267; and Kerreras et myi Bi6phys:^Bidch^,^} Acta 7275 3^-42) SO-4 
. which ,nclude^.the; N.tenninal:46^y^^^^^ set forth in SEQ ID NO 13 is isolated 
fi^tn the leavesvof &;«,narto q^c^a/«rby>extracUon With b.l M i^hbsi^h^ie buffer at 
.PH?.fqIlowed,by,<iialysis.ofthesupenia^^^ 
0^ jI^tive^ elMtion. from a negatively, charged -16,^ ^chahge^ .^sin. such as Mono S 

Ghemiwls, Swedenyusiftg 'a gradifenf of f t6 6:3 ^1 NaCf^d is the 
elutmg chrpinatographic faction that has -SAP activity. ' tht eluting 
. , fraction jsSQ-5-. .. . . 



15 



20 



.25 



30 



35 



,5 , B^ecause.fmore , than .one amino gbup' dn SAP ihay^ rfeact ' With the 

succm,m,dyl moiety, it,is,possible that more than one amino group oh the surface of the 

pi^tem IS reactive. This creates the potential for ftet<^6gefteity:^e^^^ 

,SAP IS used, This source of heterogeneity has- been ^olvfed by^the'cb&|ugktirig 'n^odif.ed 

, SAP.expressed in E, coli that has ah additional cysteihe iWferted in the coding sequence 

preferablyAvithiniO or<2P aminoacidso.^^ 

, - As discussed above.'^muteinsiofsapbrirfth^^^ a Cys at or near'Ue 

ammo pr,carlK,xyl teiminusxan be prepared/U 

•intrpduce a .sulfhydryl, the saporin can be modified by the ■ infroductibn of a cysteine 
residue intQ:,the>SAPisuehI..that;the resultingvmodified saporin protein reacts with a 
H^EGF ndonomer pif a linker (and.th^ to a HBEGF monomer) to prbduce a conjugate. 
- ' • i 'P^ferred loci ;for introduction of a cysiteine residue- include the N- 
termmus region, preferably < within about one to tWenty residues, more preferably one to 
.abQut ten.resi4ues. ftpm.thi? :N-teiminus ^of the cytotoxic agent, such as SAP For 
expression.of SAP in the bacterial host systems herein, it is ^Tso desirable to add DNA 
encodmg a methionine Imked to the DNA encoding the N-terminus of the saponn 
protein. E^NA encoding SAR.has.been modified Byrinsehing a DNA encoding Met-Cys 
(ATG TGT or ATG.TGG) at the N-terminus immediately adjacent to the codon for first 
residue pfthe mature protein. . . . 

. T ;. Muteins in which a cysteine residue has been added at the N-ferminus 
and mmeins jn which the amino acid at position 4 or tO has been replaced with cysteine 
have been prepared by modifying the. DNA encoding sapbrin (see. Example 3). TTie 
modified . DNA may be expressed, and: the resulting sapbrin protein purified as 
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described herein for expression and purification of the resulting SAP. The modified 
s^rin can then be reacted with an HBEGF. to form disulfide linkages between the 
HBEGF and the cysteine residue on the modified SAP. 

Typically, SAP is derivatized by reaction with SPDP. This results in a 
5 heterogeneous population. For example, SAP that is derivatized by SPDP to a level of 
0.9 moles pyridine-disulfide per mole of SAP includes a population of non-derivatized, 
mono-derivatized and di-derivatized SAP. Methods for isolation of mono-derivatized 
saporin are described, for example, in Lappi et al. (1993) Anal. Biochem. 2/2:446-451, 
copending U.S. Application Serial No. 08/099.924). The methods rely on the charge 
10 differences among the three species of SAP that arc produced upon reaction of one or 
more lysines in saporin with SPDP. The mono-derivatized saporin species is purified 
by Mooo-S cation exchange chromatogiaphy JuiU pooling of ihe fractions that contain 
the monoderivatized species. Briefly, as described in the copending application, the 
initial eluting peak is composed of SAP that is approximately di-derivatized; the second 
15 peak is mono-derivatized and the third peak shows no derivatization. The 
di-derivatized material accounts for 20% of the tivree peaks; the second accounts for 
48% and the third peak contains 32%. Fractions that have a ratio of SPDP to SAP 
greater tiian 0.85 but less than 1.05 are pooled, dialyzed against an appropriate buffer, 
such as 0. 1 M sodium chloride, 0.1 M sodium phosphate, pH 7.5, ased for coupling to a 
20 linker, to a HBEGF or to HBEGF witii linker. 

The resulting preparation, although more uniform, is heterogeneous 
because native saporin as purified firom the seed is a mixture of four isoforms, as judged 
by protein sequencing {see, e.g., copending published International PCT Application 
WO 93/25688 (Serial No. PCT/US93/05702), which is a continuation-in-part of 
25 copending United States Application Serial No. 07/901 ,718; see also, Montecucchi et 
al. (1989) Int. J. PepL Prot. Res. 55:263-267; Maras et al. (1990) Biochem. Internat. 
27:631-638; and Barraet al. {\99\) Biotechnol. Appl Biochem. 75:48-53). This creates 
some heterogeneity in tiie conjugates, since the reaction with SPDP probably occurs 
equally each isoform. This source of heterogeneity can be removed by using saporin 
30 expressed in £. co/i. 

c. Chemical conjugation of an HBEGF polypeptide to linkers 
and targeted agents 

The HBEGF polypeptides are preferably linked via non-essential 
35 cysteine residues to tiie linkers or to the targeted agenL HBEGF tiiat has been modified 
by introduction of a cys residue at or xicsa; one terminus; tiie N-teraiinus is preferred; is 
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used in chemical conjugation (see Examples for preparation of such modified HBEGF). 
Methods for coupling proteins to the linkers; such as the heterbbi functional agents, or to 
nucleic acids, or to proteins are known to those of skill in the art and are also described 
. herein.- • J -,.. " ■ vt-'-i-: ■. , •■:.>. .;■ : J' ■[ • • 

' '■■ • • ■ > 'Methodsfforchemical conjugation of proteins ^a^^ 
. X . in -the art; f illhe! 'preferred methods for Chemical conjugaition depend on the selected 
componehts/but preferably. riely on disiilfide bond formatibh: ' ' ' 

2. • Fusion p'i-otdn of ah HBEGF polypeptide and tWrgetbd agent 
• "Expression of DNA encoding a fiisidn of a 
10 the targeted ajgent results in'a hibre homogdnediK preparation of cytotdx^ 
- ' and is suitable for lisb; when the' sfelected targeting agent jaind linker are jpiiiypeptides. 
Aggregate foiinatiort'can'be iisduced irr prepaiauons^c^ the fiision plroteins by 

• modifying- the HBEGF, such as by removal of nonessential cysteines in th'e heparin- 
bindihg domain^ (amino acids 1-45) and/or the targeted' agent to pi^eveiit interactions 
15 between each cohjugateVis'uch ak -via liiii^c^ t iv . . . .^ 

■' a. -Ex^^ression of HBEGF " '"' " 
DNA encoding the hIBEGF poiypeptide may be isolated, synthesized or 
obtained from cdrrimefci^ herein in Example 4 and in 

^ " Ihternatibn^l Applifcatioh Wby92/66765 Jan^^ 

20 serial No. 07/598,082), and Abrah^n bit ^1/ (190^^ Comm. 
790. 125-133. Exjiression of Recombinant HBEGF pol^ may be performed as 

described hereihf imd DNA encoding HBEGF polypeptid^^^ be used as the starting 
materials for the methods hereiri^^ ; : ,1. , ' . : r . ! . 

DN A encoding HBEGF polypeptides and/o the amino acid sequences of 
25 HBEGFs are knbwn to those of skill in'thiya^ (see: e.g.. SEQ ID NOs. 1-5). DNA may 
be prepared synthetically based on the amino acid sequence or known DNA sequence of 
ail HBEGF or may be isolated using methods icnowri to those of skill in the art or 
obtained from' commercial br other sburces kiiown to those of skill in this art. For 
example, suitable niethbds iare described in Example 4 for amplifying HBiEGF encoding 
30 cDNA from well known plasmids (eg., pMTN-HBEGF; ATCC #40900 and pAX- 
HBEGF, ATCC #40899) containing HBEGF encoding cDNA. 

Such DNA may then be mutagenized using standard methodologies to 
delete or delete and replace any cysteine residues, as described herein, that are 
responsible for aggregate formation. If hecessar>', the identity of cysteine residues that 
35 coiitribute to aggregate formation may be determined empirically^ by deleting and/or 
deleting and replacing a cysteine residue and ascertaining whether the resuhing'HBEGF 
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with the deleted cysteine forms aggregates in solutions containing physiologically 
acceptable buffers and salts. Loci for insertion of cysteine residues may also be 
determined empirically. Generally, regions at or near (within 20, preferably 10 amino 
. acids) the C- or, preferably, the N-terminus are preferred. 
5 The DNA construct encoding the conjugate can be inserted into a 

plasniid and expressed in a selected host, as described above, to produce a recombinant 
HBEGF-toxin conjugate. Multiple copies of the modified HBEGF-cytotoxic agent 
chimera or modified HBEGF-cytotoxic agent chimera can be inserted into a single 
plasmid in operative linkage with one promoter. When expressed, the resulting protein 
10 will be an HBEGF-cytotoxic agent multimer. Typically two to six copies of the 
chimera are inserted, preferably in a head to tail fashion, into one plasmid. 

b. Preparation of muteins for recombinant production of the 
conjugates 

For recombinant expression using the methods described herein, up to all 
1 5 cysteines in the HBEGF polypeptide that are not required for biological activity can be 
deleted or replaced. Alternatively, for use in the chemical conjugation methods herein, 
all except for one of these cysteines, which will be used for chemical conjugation to the 
cytotoxic agent, can be deleted or replaced. Each of the HBEGF polypeptides described 
herein have six cysteine residues. Each of the six cysteines may independently be 
20 replaced and the resulting mutein tested for the ability to bind to HBEGF receptors and 
to be internalized. Alternatively, the resulting mutein-encoding DNA is used as part of 
a consdruct containing DNA encoding the cytotoxic agent linked to the HBEGF- 
encoding DNA. The constmct is expressed in a stiitable host cell and the resulting 
protein tested for the ability to bind to HBEGF receptors and internalize the c>totoxic 
25 agent. As long as this ability is retained the mutein is suitable for use herein. 

c. DNA constructs and expression of the constructs 

DNA encoding HBEGF conjugates is expressed in any suitable host, 
particularly bacterial and insect hosts. Methods and plasmids for such expression are 
set forth in the Examples (see, also TABLE 3). Using the methods and materials 

30 ' described above and in the Examples numerous chemical conjugates and fusion proteins 
have been synthesized. These include those set forth in TABLE 3 below. 

Particular details of the syntheses of the conjugates and DNA constructs 
are set forth in the Examples. The constructs have been prepared and have been or can 
be inserted into plasmids including pET 1 1 (with and without the T7 transcription 

35 terminator), pET 12 and pET 15 (INVITROGEN, San Diego), XpPL and pKK223-3 
(PHARMACIA, P.L.) and derivatives of pKK223-3. The resulting plasmids have been 
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and. can be transformed into bacterial hosts including BL21(DE3), BL2r(DE3)+pLYS 
S,~.H.MS174(DE3), HMS 1 74(DE3)+pLYS S (NoVageh/ Madison/ WI) and 
- - ■ :N4830(cI«57)i (see, .Gottesman et al. (1980) J. Mol. 7^0 57^75, commercially 

available from PL Biochanicals-Inc,' also, .see. e ;?;, U.S. Patenf Nos: 5,266,465, 
. ,;5- -A^ms f^a56,^^9^o:5^56J69y-''5^5-2jB; ' ■5^56*^96,*' 5,244.797, 5,236,828. 
■5^m29.v5,229,273/:4,798^^^^^^^ 

a heaYily >deletedi phage lamWda prbphage carrying the rtutaht cl S57 temperature 
^ sensitive repressor and an active ;N gene} The constructs h^ve also been introduced into 
. . a , bacylovinis: : vector sbld -eomirierijiany under the n^e pBLOEBACIII 
10. (INVITROGEN, San Diego CA; sfeeWe INVITROGEN CAiTALOG; see, also, Viaiard 
etal. (1990)^ riro/j=d^;37; Pateiif-N^^ 5:270^458; U.SfP^teiit No. 5,243,041; and 
published ^nte^lationa^^PCT Application- WO 93/10139;^ which is based on U.S. patent 
application Serial No. 07/792,600. The pBlueBscIll Vector is a dual promoter vector 
and provides for the selection of recomibinants by biue/white screening as this plasmid 
15 contains the p-galacitosidase gene (lacZ) imdeV the contfof of the insect recognizable 
ETL promoter, and 5s inducible with IPTG. the baculoviriis vector is," then co- 
; .:, i transfected; with wild. type virus into insect host cells^S^oi^op/era fi-ugiperda {sf9\ see, 
h:: i .e.g. Luckow et al. ( lc988) ^io/tecfmol6gy'6:47l55 iuid U.S; Patent Nb: 4;745,05 1 ). 









|PlksMidl(s) ithat 
1 Encode the 
IProtein*:**:.-, 


/iv i?-r'»:v^::i-^ -^^..j / : ^t- 

Description of Fusion Protein . . , ... 


Fusion . . 1 

Protein 

Name 


1 ~ ^ ;nm*;;;' ; 


^ ^ , . wild t)^e FGE chcmi^ ^ . 


CCFSl 


N/Asv-, .• 


r :. L mutein FGF C78S cfemical itinjugate* 


CCFS2 




mutein FGF C96S chemical conjugate 


gCFSS., 


N/fA.,../'. 


mutein FGF C96S Cys-SAP CYS-1 chemical 
ponjugate , ^ . , V , ; : ; : ^ 


CCFS4 


PZ1A,PZ1B, ' 

pzie. pziD, r 

PZIE,...,.; ; . ;,, 


- wild type (FGF-Ala-Met-SAP) fusion protein** 

.■T^.q • .-. 'v;- .... ...r:;. ' ^ 


FPFSI 


PZ50B1 


SAP CtS-l>CT ^ _ 


FPSl.^ 


PZ51B1 . 


SAPCMS+4 PET l laBL21(DE3> • - 


FPS2 
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Tyy C 1 IT 1 

rZ.^ 1 b I 


Q A P VQ-I>/1 PT^T 1 '> 1 /'T^P'^^ 
oAi 1 0^^** r ri 1 1 DD r>l^Z 1 l^iyn.^ ; 




rZJbZD 1 


CAP A PITT 1 1 o HT O 1 /'FlPlX 


ri i3-> ^ 


PZ52E1 


SAP CYS+10 PET 15b BL21(DE3) 


FPS3 


PZ30B1 


HBEGF PET 1 la BL21(DE3) 


FPHl 


py^^i Ri 

rjuj i £> 1 


HRFOF-Val-Met-SAPPETM a BLIKDESV 


FPHSl 


PZ32B1 


HBEGF-Ala-Met-SAP PETl la BL21(DE3) . 


FPHS2 


PZ33B1 


HBEGF-Ala-Met-TRYPSIN-Ala-Met-SAP 
r^x^ Ilia oi^z 1 vi^i^<> ^ 


FPHS3 


PZ34B1 


HBEGF-Ala-Met-CAT D-Ala-Met-SAP PETl la 


FPHS4 


PZ35d1 


nxStUr •vOiy4oerxoiy7^crx^iy4^^* J2^™^ 
PETllaBL21(DE3) 




PZ36B1 


SAP-AIa-Met-Ala-HBEGF PETl la BL2T(DE3) 


FFSHl 


rZ-J la I . 


PETllaBL21(DE3) 


FPSH2 

• 


PZ38I 


Met-Cys HBEGF Viral Stock 


FPH2 


PZ39I 


Met-Cys-Ala-Met-Ala-HBEGF Viral Stock 


FPH3 


PZ40I 


Met-Cys-Ala-Met-(Gly4Ser>2-Ala-Met-Ala- 
HBEGF Vira! Stock 


FPH4 


PZ41I 


Met-Cys-Ala-Met-(Gly4Ser)4.Ala-Met-Ala- 
HBEGF Viral Stock 


FPH5 



* Details regarding these constructs are described in U.S. Application Serial Nos. 
08/213,446 and 08/213,447, and PCT Appln. US 94/0851 1, filed July 27, 1994. 
N/A = not applicable 

5 ***The plasmids, such as PZlAl are designated with (i) a PZniunber (PZl), followed 
by (ii) a letter (A), and optionally (iii) followed by a number (1). The key to these 
designations: (i) PZnumber - refers to the construct number, (ii) the next letter refers to 
the plasmid into which the construct was cloned, A=pET 1 1 wthovrt the fl ^inscription 
terminator, B=pET 'll with the T7 transcription terminator. c=pET 13, p=pET 12, 

10 E^ET 15. F=XpPL, G=pKK 223-3, H=PRZ 1 (pKK223-3-!-Kan^). l=p6lueBac III, 
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J=PRZ2 (pKK223-3 + Kan^ + lad gene) and (ii) the optional number (or letter) refers 
to -the -bacterial strain (number) or insect host (letter) in which the plasmid was 
introduced, 1=BL21(DE3), ' 2=BL210E3)+pLYS S; 3=MMS17it(DE3), 
r 4=HMS174(DE3)+i^ S;5^483Q(cI8^76>^ V^,^] 

5: i Fusion proteins FPHS5 and FPSH2 are purified irom cell paste: ^Briefly, 

; cell paste is suspended in 3-4 volumes of Ull iysfe bUffe?' c^^ 
; citrate^ pH 6.0, 1 M^urpa, 5 i^jyl EbTA,r5 mM EGTA and>5iQ mM NaGL Thei lysate is 
% passaged, 3 times through a microfluidizer and diluted to 1 0 volumes with lysis buffer. 
The concentration of urea should be less thm S M to reduce viscosiiy^ arici it is not 
1 0 necessary to include, a.cpcktail of proteasei inhibitors. Urea :is: iiecessary for isojation of 
active prplein. The extract is loaded onto an. expanded bed of Streamline SP, cation- 
exchange resin equilibrated v.'ith lysis buffer. Pro^^^ wiiK.^ fc"^^^^^ 
containing increasing NaCl concentrations: the first buffer ycontain^ 0.25 M NaCl and 
the secorid buffer contains 0.8 M NaCl. The second eluate is diluted in buffer j^without 
, 15 NaCl and subjected to ariion-excha[rige chromatography 

DNA, endotoxins and contaminating proteins, and cation-exch^ge chromatography on 
SP-Sepharose to remove other contamin Proteinsi bpund to the S-Sepharose 

■ column are eluted with a gradient of 0.25 to 1 M NaCl in buffer Ammonium sulfate is- 
- added to the fusion pfoteins. As a pbsitiy^jsefec^ion,^ Ijie^ protein is loaded^ onto a 
20; phenyl.Seph2urpse HP column and . eluted . with „ buffer, containing 2 .M amrhoniiun 
sulfate. .Monothioglycerol is added, to the fusion protein.^ The protein is ldialyzed and 
subjected to size -exclusion chrpmatography* 6a No heparin affinity 

chromatography is, perform^ refolding pcQtocoj is not necessary to attain active 

. material in the case of conjujgates. : It willr be readily recognized that other equivalent 
25 resins and buffers may be readily substituted at each step in accordance vyith the 
purpose, of each purification step. , That is,, for. example, other equivalent cation 
exchange resins may be used in place of SP-Sepharose,V.rl . ' 

. . . . FPHS2 'and FPHSl fusion proteins are purified as above except that a 
: heparin sepharose FF affinity column was additionally used prior to the.S- 1 00 colimin.. 
30 F. • ' Properties and uise 'of the chemical conjugates and fusion prbteins ; / 

The conjugates provided herein can be used m v//ra'to identif>' cells, 
^' particularly tumor cells that express receptors to which the conjugate selectively binds 
and which iniemalizes the conjugates. The cells are contacted with the conjugates and 
. assayed for proliferation. Cells in which proliferation is inhibited e?cpress receptors to 
35 which ^HBEGF binds. If such cells are derived from a .tiimor, such tumor will be a 
J,. candidate for treatment with the HBEGF conjugate. If such cells are a cell line* such 
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cell line will be useful in drug screening assays for identification of compounds that 
modulate the activity of HBEGF receptors (see. e.g., U.S. Patent Nos. 5.208,145, 
5,071.773, 4,981,784, 4,603,106. which describe such assays for other receptors). 

Each of the HBEGF-containing conjugates produced by the methods 
5 described herein can be tested, using a variety of well known in vitro and in vivo assays, 
for their ability to exert a cytopathic effect. For example the Promega (Madison, WI) 
CellTiter 96 Cell Proliferation/Cytotoxicity Assay described above and in Example 4 
may be employed. In addition, in vitro cytotoxicity assays described in, for example, 
Kreitman et al. (1991) Bioconjugate Chem, 5,63-68; Epstein et al. (1991) Circulation 

10 5-^. 778-787, and the like, may be employed to test the conjugates produced herein. 

In another assay that may be employed EGF-receptor expressing cells 
are plated in 96-well tissue culture plates at 1 000-3000 cells/well in their respective 
medium. One day later, the medium is removed, and medium containing 1 pM to l^iM 
of the conjugate HBEGF-SAP, free SAP, free HBEGF, and HBEGF+SAP are added. 

15 Cells are treated in triplicate and maintained at 37^ C and 5% C02. Forty-eight hours 
after the treatment is initiated, the MTT colorometric assay is utilized to measure cell 
sensitivity to HBEGF-SAP conjugates (Mossman, T. (1983) J. Immunological Meth. 
65:55-63). Results are expressed as the mean optical density from treated wells, 
normalized to media controls, as a function of the HBEGF-SAP, free SAP, free 

20 HBEGF, and HBEGF+SAP concenU^tion. The 50% inhibition values are calculated 
from dose-i;esponse curves and represent the concentration which resulted in a 50% 
reduction in cell mmibcr. * 

G. Formulation and administration of pharmaceutical compositions 

As used herein, treatment means any manner in which the symptoms of a 
25 condition, disorder or disease are ameliorated or otherwise beneficially altered, reduced 
or relieved. Treatment also encompasses any pharmaceutical use of the compositions 
herein. ' 

, As used herein, amelioration of the symptoms of a particular disorder by 
administration of a particular pharmaceutical composition refers to any lessening, 
30 whether permanent or temporar>', lasting or transient that can be attributed to or 
associated with administration, of the composition. 

The conjugates herein may be formulated into pharmaceutical 
compositions suitable for topical, local, intravenous and systemic application. Effective 
concentrations of one or more of the conjugates are mixed with a suitable 
35 pharmaceutical carrier or vehicle. The concentrations or amounts of the conjugates that 
are effective requires delivery of an amount, upon administration, that ameliorates the 
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symptoms or treats the disease. Typically, the compositions are formulated for single 
dosage, administration. Therapeutically effective concentrations and amounts may be 
detominei.empiriGally .by testing the conjugates iVi known in vitro and in vivo systems, 
such as ihQse ..described :here^ dosages fdr hiimans ot other animals may then bJ 
^j^polajte^vtberefrcmr.-i. ' - ^ ' ' '' - ' ' " - • - - - 

, ; - ; . . t rJ?poh?a!nixing or ■ addition "of' the 'cohjugkteCs) with the. vehicle, the 
resultingrmixtuiie ihayibe a solution/suspension, eihulsion br the the form of the 
resulting TOixture depends -upon a- number of factors; inclWihg- the iiitended. mode of 
administration and the solubility of the conjugate iri the selected carrier or vehicle. The 
effective concentration is ^ufficieht • for • ameliorating ihfe ■symptoms of the ' disease, 
, disorder or eonditiort treated afid may -be empirically deterihiried based upon in vitro 
and/or in vivo data, sucxV « t&d data -fiom tlie mbiise Xeiidgr^ midel tpr tiunois or 
• rabbit ophthalmic model. ^ If .becessaiy; iihaiTna^^^^ other 
derivatives of the conjugates may-be prepared. ' ' ' ^ ' ' ' ' " ' ' ' 
^ ^ Phairihaceutical carriers 6r vehibles suitable ifor adminisu^tion of the 

conjugates provided herein include any such carriers knoWW^^^^ 
• - be.suitable for the particular mode of administration: In additidh, the conjugates may be 
r. . formulated as the sole phannaceUticaMy aetive ihgreditnt'in the 'cbmpositipn or may be 
combined with other active ingredifenfe^^^ Ay)ir:z-, o: v-is.; nrj. 

.20;,, .... .:r.As:;usea hfeireinv - i^harinabelifrM acbgptaby ' 'saKs,^ other 
derivatives ofthe conjugatesf infclude ai^y^salts; e^ttt^ of dfeiiVaWvw i^aV'ii^y be readily 
prepared by those of skill in this art using known method^' for siicH derivatization and 
that produce compoundsv that -.may ^be adminisier^a"^ to animals 6r 'hiimans without 
. , substantial toxic effects, and , that either are phanttact^iiUcally active or are prodrugs. A 
,25 prodrug is .a; compound; that. Upon in v/vo^adiftiiiistration, is metabolized or otherwise 
converted -to , the, biplpgically. pharmaceutically^r- thera^ of the 

compound. To produce a prodrug, the pharmaceutically active compound is modified 
.such that the active pompound will be regenerated by ineteboric processes. The prodrug 
may be designed to alter jhe^metabolic stability or the transport characteristics of a drug, 
30 . to mask side- effects or toxicity, to improve the flaVPr bf a drug or to' Wter othfer 
characteristics or properties of a drug. By virtufe of knowledge of phafmacodynamic 
proce^es and drug metabolism in vivo, those of skill in this art, once a phannaceutically 
active compound is.known, can design prodnigs of the compound (see, e.g.. Nogrady 
(1985) Medicinal, Chemistty .A .Biochemical Approach. Oxford University Press. New 
35; York, pages -388-392). - . ^ - i .• r ; . ' . : 
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The conjugates can be administered by any appropriate route, for 
example, orally, parenterally, intravenously, intradermally, subcutaneously, or topically, 
in liquid, semi-liquid or solid form and are formulated in a mannet suitable for each 
route of administration. Preferred modes of administration depend upon the indication 
5 treated. Dermatological and ophthalmologic indications will typically be treated 
locally; whereas, tumors and vascular proliferative disorders, will typically be treated 
by systemic, intradermal or intramuscular, modes of administration. 

The conjugate is included in the pharmaceutically acceptable carrier in 
an amount sufficient to exert a therapeutically useful effect in the absence of 

10 undesirable side effects on the patient treated. It is understood that nimiber and degree 
of side effects depends upon the condition for which the conjugates are administered. 
For example, certain toxic and undesirable side effects are tolerated when treating life- 
threatening illnesses, such as tumors^ that would not be tolerated when treating 
disorders of lesser consequence. 

15 The concentration of conjugate in the composition will depend on 

absorption, inactivation and excretion rates thereof, the dosage schedule, and amount 
administered as well as other factors known to those of skill in the art. 

As used herein an 2;ffective amount of a compound for treating a 
particular disease is an arnount that is sufficient to ameliorate, or in some manner 

20 reduce the symptoms associated ^Aith tJhe disease. Such amount may be administered as 
a single dosage or may be administered according to a regimen, whereby it is effective. 
The amoimt may cure the disease but, typically, is administered in order to ameliorate 
the symptoms of the disease. Repeated administration may be required to achieve the 
desired amelioration of symptoms. 

25 Typically a therapeutically efTcctive dosage should produce a serum 

concentration of active ingredient of fiDm about 0.1 ng/ml to about 50-100 ^g/ml. The 
pharmaceutical compositions typically should provide a dosage of from about 0.01 mg 
to about 100 - 2000 mg of conjugate, depending upon the conjugate selected, per 
kilogram of body weight per day. Typically, for intravenous or systemic treatment a 

30 daily dosage of about between 0.05 and 0.5 mg/kg should be sufficient and can be 
administered as a bolus or continuous infiision. Local application for ophthalmic 
disorders should provide about 1 ng up to 100 jig, per single dosage administration. It 
is understood that the amount to administer will be a function of the conjugate selected, 
the indication treated, and possibly the side effects that vsdll be tolerated. Dosages can 

35 be empirically determined using recognized models for each disorder. 
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The active ingredient may be administered at once, or may be divided 
, into.a number ofsmaller doses to be administered atinte^^^ It is understood 

that the precise dosage, and lunation .of treaunent is a function of the disease being 
,may,l?c..dcte|nMij«M ^ empirically. uiing khoWn testing protocols or by 
*. % ^^'^^^ m^-fV'irhYitro. twt data. It is td be' nbted that concentrkions and 
dosage values may also.v^^^ It to 

be further undeistQotf. that foi;, any particular subject, specific dosage fegiJnens should be 
adjusted oyer,time< aGcording, to the individual^need and the professional judgment of 
the pprson, administering or superyising the administration of the compositions, and that 
10.. the.concentratipn,ranges 5et ,.forth,herei and are 'not 'iritended to 
. liimt the scope or practi^^^^ : . 

. .. , i Solutiops or> suspensions, used for- parenterals iiiiradcmial, subcuUuicou^i, 

or topical application can include any of tiie followng components: a stenl^ diluent, 
such as water for injection, saline solution, fixed oil, polyethylene glycol: glycerine. 
.15 propylene glycol or other synthetic solvent; antimicrobial agents! such as benzyl alcohol 
and methyl parabens; antioxidants, such as ascorbic kid and sodiiun bisulfite; Chelating 
agents, such as ethylenediaminetetraacetic acid ?(EDTA); bufferi, sudh as acetates, 
v"^ ^-'— phosphates; and agents i for the adjustment of tonicity such as sodium 
3.l:u,^.5^^e'^^*^^o^^^«t^ose.^^^ 

.20,, syringes orniulUple dose, vials madeof glass, plastic or other 'suitablfe^m'^teria^ 

...... If .administered, intravenously, suit^ble^^^ include physiological 

... . saline or phosphate buffered isalihe (PBS), and solutions containing thickening and 

. . solMbilizing agents,: s^h as.glucose,; polyethylene glycol, and polypropylene glycol and 
mixtures thereof Liposomal suspensions may also be suitable as pharmaceutically 
25 acc^table carriers. .TJiese n^y be prepared according to methods known to those 
skilled in; the art., ' -• i- • ■ ■•r..' 

! ;The';Conjugates may be prepared' with caniers that protect thehi against 
: . ' . f^P'^: eliinination fi;om the bo,dy, such as. time release formulations or coatings. Such 
. .carriers include cor»u-olled release formulations, such as, but not limited t6: implants and 
3,0. microencapsulated.deliveiy systems, and biodegradable, biocompatible polymers, such 
, as . ethylene vinyl acetate, polyanhydrides. polyglycolic acid, polyorthoesiers, 
polylacetic acid and others. These are particularly useful for application to the eye for 
ophthalmic indications ^ following or during surgery in which ^ only a single 
administration is. possible., Methods for preparation of such fonnulatiohs are known to 
35 those skilled in the art,, ..... - , i i: . ! . , 
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The conjugates may be formulated for local or topical application, such 
as for topical application to the skin and mucous membranes, such as in the eye, in the 
fomi of gels, creams, and lotions and for application to the eye or for intracistemal or 
intraspinal application. Such solutions, particularly those intended for ophthalmic use, 
5 may be formulated as 0.01% -10% isotonic solutions, pH about 5-7, Avith appropriate 
salts. The ophthalmic compositions may also include additional components, such as 
hyaluronic acid. The conjugates may be formulated as aerosols for topical application 
{see, e,g,. U.S. Patent Nos. 4,044,126, 4,414,209, and 4,364,923). 

If oral administration is desired, the conji^ate should be provided in a 
10 composition that protects it from the acidic environment of the stomach. For example, 
the composition can be formulated in an enteric coating that maintains its integrity in 
the stomach and releases iiic aciivc compound in the intestine. The composition may 
also be formulated in combination with an antacid or other such ingredient. 

Oral compositions will generally include an inert diluent or an edible 
15 carrier and may be compressed into tablets or enclosed in gelatin capsules. For the 
purpose of oral therapeutic administration, the active compoimd or compounds can be 
incorporated with excipients and used in the form of tablets, capsules or troches. 
Pharmaceutically compatible binding agents and adjuvant materials can be included as 
part of the composition. 
20 The tablets, pills, capsules, troches and t|ie like can contain any of the 

following ingredients, or compounds of a similar nature: a binder, such as 
microcrystalline cellulose, gum tragacanth and gelatin; an excipient such as starch and 
lactose, a disintegrating agent such as, but not limited to, alginic acid and com starch; a 
lubricant such as, but not limited to, magnesium stearate; a glidant, such as» but not 
25 limited to. colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
and a flavoring agent such as peppermint methyl salicylate, and fruit flavoring. 

When the dosage imit form is a capsule, it can contain, in addition to 
material of the above type, a liquid carrier such as a fatty oil. In addition, dosage unit 
forms can contain various other materials which modify the physical form of the dosage 
30 unit, for example, coatings of sugar and other enteric agents. The conjugates can also 
be administered as a component of an elixir, suspension, syrup, wafer, chewing gum or 
the like. A symp may contain, in addition to the active compounds, sucrose as a 
sweetening agent and certain preservatives, dyes and colorings and flavors. 

The active materials can also be mixed with other active materials that 
35 do not impair the desired action, or with materials that supplement the desired action, 
such as cis-platin for treatment of tumors. 
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. : Finally, the compounds may be packaged as articles of manufacture 

: ; containing-.packaging material, one or more conjugates or compositions as provided 
: , herein within the packaging material, and a label that indicates the indication for which 
■ .•■.^■U^eeonjugate>is.prdvided'. • 'i'?""^ 

5 H:;. v;Ther*peu^^^ ■ ' 

- . THt' conjiigiates providi^ci herein can be useci in phaniiaceutical 

compdi5ifions%ij^t' HBEGF-ra^ to 
cells that bear HBgGF¥edep(t6rS and iiftibitift'g^^^^^ causing death of the 

. ■ cells? SUcH jjathophysiologicai cdnditibhs- for i^x^ple, certain tumors such as 

10 renal cell carcinofhai' and bre^t and' todeV tiijii^is, psoriasis:' b^^^ 

- invoMng epithelial cais,''sU^^ jens clouding. 

Tne treatment^ is eiTecied by ad^ihisteriHii 'mer^^ui^^^^^^^ of the 

HBEGF conjugate; for lexiample; in a physi&ldgfiC^I vehiic^e siiitaible fo^ local of systemic 
application. ' In 'particular;- for tiieatment of 1 obelized s^n diwirders the conjugate is 
15 formulated for topical, Ibtal or intralesioh^l aj^^^^^^ the skin and is applied 

topically, Idcadly or intrkle'sibhal. ^ ' ' >o ^ - & - > . - 

1.' Trtatnieht of pathbphysibiogical smo 
' ' Atherbscrefbsi's; wHicfe results i&om the dWeiopment of aij intimal lesion 

and the subsequent narrowing of the vessel lumen, conunoiily re^^^ buildup 

20 ^ of" plaque which liries the jjarticularly the arteries. ^ In recent 

years, a liuimber 6f surjgicai i>roceclufes liave been deveioped that interaiteriajly remove 
such plaque, often by balloon ca^^^ oi- bther such treatments, either by 

' ' compressing it agaihSt or scraping it away ^ f^^ the irrterior surface of the arten-. Not 
infi-equehtly, the patient so tfeateci experierices ja recurrence pf narrowing of the, vessel 

25 liiineh in a relatively short period thereafter. This narrowing following tt^atinent to 

rembve jiiaqueisri^ferilato^resteno^^ 

Methods are provided herein for treating restenosis by adniinistering an 
effective istfftbunt of ah HEfEGF cytotoxic conjugate, such, that the HBEGF conjugate 
inhibits smooth muscle cell proliferation in the lining of vessels that have been injured 

30' ' without inhibiting pfbliferatibn of endbtheliaj cells that is necessary for preventing or 
treating restehos^^ It can be administered' locally or 

intravendusi>\ A medicament containing an HBEGF-toxih/ preferably saporin. 
conjugate will be targeted to proliferating smootfi muscle cells in the treated arteries and 
relatively f^w infiisioris (or a few, le : up to about 3-5) should prevent restenosis. 

35 Preferably, the medicament containing the conjugate is administered 

intravenously (IV), although treatment by localized administration of the conjugate may 
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be tolerated in some instances. Generally^ the medicament containing the conjugate is 
injected into the circulatory system of a patient in order to deliver a dose of cytotoxin to 
the targeted cells by first binding the conjugate to high HBEGF receptors expressed by 
such cells. 

5 The efficiency with which a cytotoxin, such as saporin or a Ricin A 

chain or a similar RIP, can inhibit protein synthesis and consequently interfere with 
DNA synthesis is fairly widely known. Accordingly, the dosage of the conjugate that is 
administered will, to some extent, depend upon the particular cytotoxin chosen; 
however, doses of the conjugate in the general range of about 0.01 mg to about 100 mg 

10 of the conjugate per kilogram of body weight are expected to be employed as a daily 
dosage. There may be particular advantages in administering a daily dosage of about 
0.1 mg/kg (i.e. between 0.05 and 0.3 mg/kg). 
2. Treatment of tumors 

Tumors, particularly solid timiors, including bladder, breast, ovarian, 

15 pancreatic and some colon carcinomas, have receptors to which HBEGFs bind. The 
susceptibility of particular tumors can be ascertained by isolating the cell, and 
contacting them with an HBEGF cytotoxic conjugate and determining sensitivity to the 
conjugate by a standard proliferation assay. This should identify those tumors that 
would be amenable to treatment and also identifies tumor cells that express receptors to 

20 which HBEGF binds. Cytotoxic conjugates, such as HBEGF conjugated with the 
saporin molecule (HBEGF-SAP), are inhibitors of cell growth in vitro for cell lines that 
express HBEGF receptors. Such in vitro activity should be extrapolatable to in vivo 
activity. In vivo activity may be assessed using recognized animal models, such as the 
mouse xenograft model for anti-tumor activity (see, e.g.. Beitz et al. (1992) Cancer 

25 Research 52:227-230; Houghton et ah (1982) Cancer Res, ¥2:535-539; Bogden et al. 
(1981) Cancer (Philadelphia) ¥5:10-20; Hoogenhout et al. (1983) Int, J. Radiat. OncoL, 
Biol Phys. 9:871-879; Stastny et al. (1993) Cancer Res. 53:5740-5744). Cell lines that 
arc sensitive to the cytotoxic HBEGF conjugates can be grown subcutaneously as solid 
tumor xenografts in nude mice, and administration of HBEGF-SAP conjugates to such 

30 mice should show nq^id reduction in tumor volume in those cell lines which responded 
to treatment of the conjugate. 

Treatment of mammals, including human patients, would be similarly 
effected by administering a thenq>euiically effective amount of the HBEGF conjugate in 
a physiologically acceptable carrier. Specifically, in the treatment, the conjugates are 

35 used to target cytotoxic agents to hiunan solid tumors, including bladder or breast 



wo 96/08274 



PCT/US95/12205 



69 



tumors, to inhibit the proliferation of such cells. The conjugates are also used to target 
HBEGF receptor-expressing cells in similar tumorigenic path6p>hysiological conditions. 

The following examples are included for illustrative purpbses only and 
are not intended to limit the scope of the invention. ' ' 
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EXAMPLE 1 

Recombinant PRODucrioN Of Saporin 

5 The manipulatiDns described in this example are also described in 

International PCT Application WO 93/25688 and copending U.S. Applications Serial 
Nos. 08/145,829 and 07/918,718. 
, A. Materials and methods 
1. Reagents 

10 Restriction and modification enzymes were purchased from BRL 

(Gaithersburg, MD), Stratagene (La JoUa, CA) and New England Biolabs (Beveriy, 
.MA). Nalive SAP was obtained from Saponaria officinalis {see, e.g., Stirpe et al. 
(1983) Biochem. J, 276:617-625). Briefly, the ^eeds were extracted by grinding in 5 
mM sodium phosphate buffer, pH 7.2 containing 0.14 M NaCl, straining the extracts 

15 through cheesecloth, followed by centrifugation at 28,000 g for 30 min to produce a 
crude extract, which was dialyzed against 5 mM sodium phosphate buffer, pH 6.5, 
centrifuged and applied to CM-cellulose (CM 52, Whatman', Maidstone, Kent, U.K.). 
The CM column was washed and SO-6 was ehited with a 0-0.3 M NaCI gradient in the 
. phosphate buffer. 

20 2. Bacterial Strains 

E coli strain JA221 (Ipp* hdsM-i- trpE5 ieuB6 lacY recAl F[lacl^ lac"*" 
pro"*"]) is publicly available from the American Type Culture Collection (ATCC), 
Rockville, MD 20852, under the accession number ATCC 33875 (JA221 is also 
available from the Northem Regional Research Center (NRRL), Agricultural Research 

25 Service, U.S. Department of Agriculture, Peoria, IL 61604, under the accession number 
NRRL B-15211; see, also, U.S. Patent No. 4,757,013 to Inouye; and Nakamura et al. 
(1979) Cell 7*;1I09-1 117). Strain INVla is commercially available from Invitrogen, 
San Diego, CA; and strains Novablue and BL21(DE3) are commercially available 
(Novagen, Madison WI). 

30 3* DN A Manipulations 

The restriction and modification enzymes employed herein are 
conunercially available in the U.S. Native saporin and rabbit polyclonal antiserum to 
saporin were obtained as previously described in Lappi et al. (1985) Biochem, Biophys. 
Res, Comm. /2P;934-942. Ricin A chain is commercially available from SIGMA, 
35 Milwaukee, WI. Antiserum was linked to AfFi-gel 10 (BIO-RAD, Emeryville^ CA) 
according to the, manufacturer's instructions. Sequencing was performed using the 
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Sequenase kit of United States Biochemical Corporation (version 2.0) according to the 
manufacturer's instructions. Minipreparation and maxipreparation of plasmids, 
preparation of competent. cells, transformation; Ml 3 manipulation, bacterial media. 
Western blotting, and ELISA assays were according to Sambrook et al. ((1989) 
5 Molecular Qoning-^AvLabbratory^Mdm^^ Laboratory Press, Cold 

!./ ii Spring Harbor,.Ny).r: Purificatiton of DNA iragmrats was athieveVi usirig the Geneclean 
II kit, purchased from Bio 101 (LaJolla,CA). • • 

4. Sodium dodecyl sulfate (SDS) gel elect^<iph6resis and Western 
blotting. ., _ 

}^ , SDS gel ^Ipp^ophoresis was performed on a PhastSystem utilizing 20% 

— ?*^'^^r, : Wes*en?,.blotting ^^^^ accomplished; by -transfer of electrbphoresed 
protein to nitrocellulose using th^,PhastT by 
the manufacturer: The antiserum,to SAP was used at a dilution of i: 1 000. Horseradish 
peroxidase labeled anti-IgG was used as. the second antibody as described ^(Davis, L., 
\^ . Dibner et al. (1986>) Basic Methods in MoleculanBioldgy, jyp.> l-338V -Elsevier Science 
Publishing Co., New York). • : = j. -^^ > v...: 'i-; - a-.^ . n/ % v. ,-. , 

• .5. , Celi-fi-ee assay for cytotoxic activity ' ' . ' v^^h'. n 
, .; ' , ' : = ilTie RJR activity of saporin xiaii be and is determift^^ 

measuring cell-free protein synthesis in a nuclease-treated rabbrt ' ^etliciifocy^e lysate 
20 (Promega). Samples of saporin are added on iqestOE^Sfulof^^abbit reticulocyte ly'^te 
, ..Mid 10 ^1 of a enaction mixture containing/O.S tiJ of Brome , Mosaic Virus RNA, 1 mM 
, amino acid mixture min;us,4eucine, SjfjCi of tritiated leucine and 3 yd of Water.n Assay 
tubes are inpubated l hpur. in a, 30 C water bath. The reaction is stOpped by transferring 
the tubes to ice and a(iding,,5 fil of assay mixture, in triplicate, to 75 ^il of 1 N 
25 sodium hydroxide, 2.^%. hydrogen peroxide .in the wells .ot a Millititef HA 96-well 
J filtration plate (MiHipor?).; Wh^n the red color has bleached froin the samiples, 300 \i\ 
.,pf ice cold 25% trichloroaceuc.aci4 (T^ added to each well and the plate left on 
ice fpr another 30 niin. . Yacuuni filtration is performed with a. Milliporc vacuum holder. 
The wells are washed three times with 300 ^l of ice cold' 8% TCAl ' After drying, the 
30 filter paper circles are punched out of the 96-:weII plate- iuid counted by liquid 
scintillation .techniques. The ICsp for ^recombinant, and native saporin is approximately 

■' '20' pM. ' \ ...... 

B. I.solation,o,f DN4 encoding saporin ^. . > . , 

1- : Isolation of genomic DNA and preparation- of amplification primers 

35. jSaponaria officinqUs \t3{ g^ 

Bianchi et al.; {\9%Z) Plant MoL.BioL IJ :203^2}4? Primers for genoinic DNA 
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amplifications were synthesized in a 3 SOB automatic DNA synthesizer. The primer 
corresponding to the "sense" strand of saporin (SEQ ID NO. 27 includes an EcoR I 
restriction site adapter immediately upstream of the DNA codon for amino acid -15 of 
the native saporin N-temiinal leader sequence (SEQ ID NO: .27): 5'- 
5 CTGCAGAATTCGCATGGATCCTGCTTCAAT-3\ The primer 5'- 

CTGCAGAATTCGCCrCGTTTGACTACTTTG-3* (SEQ ID NO; 28) corresponds to 
the "antisense" strand of saporin and complements the coding sequence of saporin 
starting from the last 5 nucleotides of the DNA encoding the carboxyl end of the mature 
peptide.. Use of this primer introduced a translation stop codon and an £coRI restriction 

10 site after the sequence encoding mature saporin. 

2. Amplification of DNA encoding saporin 

Un£ractionated Saponaria officinalis leaf genomic DNA (1 ^il) was 
mixed in a. final volume of 100 fxl containing 10 mM Tris-HCl (pH 8.3), 50 mM KCl, 
0.01% gelatin, 2 mM MgCl2, 0.2 mM dNTPs, 0.8 ^g of each primer. Next, 2.5 U TaqI 

15 DNA polymerase (Perkin Elmer Cetus) was added and the mixture was overlaid with 
30 |il of mineral oil (Sigma). Incubations' were done in a DNA Thermal Cycler 
(Ericomp). One cycle included a denaturation step (94 C for 1 min.), an annealing step 
(60 C for 2 min.), and an elongation step (72 C for 3 min.). After 30 cycles, a 10 jal 
aliquot of each reaction was run on a 1 .5% agarose gel to verify the correct size of the 

20 amplified product. 

The amplified DNA was digested with £ct>RI and subcloned into EcoRl 
I-restricted M13mpl8 (f4EW ENGLAND BIOLABS, Beveriy, MA; see, also, Yanisch- 
Perron et al. (1985), "Improved M13 phage cloning vectors and host strains: 
Nucleotide sequences of the M13mpl8 and pUC19 vectors' . Gene 33:103). Single- 

25 stranded DNA from recombinant phages was sequenced using oligonucleotides based 
on internal points in the coding sequence of saporin (see, Bennati et al. (1989) Eur J, 
Biochem. 75i,-465-470). Nine of the M13n^l8 derivatives were sequenced and 
compared. Of the nine sequenced clones, five had imique sequences, set forth as SEQ 
ID NOs. 8-12, respectively. The clones were designated M13mpl8-G4, -Gl, -G2, -G7, 

30 and -G9. Each of these clones contains all of the s2^)orin coding sequence and 45 
nucleotides of DNA encoding the native saporin N-terminal leader peptide. 
C. pOMPAG4 Plasmid Construction 

Ml 3 mpl8-G4, containing the clone containing saporin of SEQ ID 
NO. 8 from Example I.B., was digested with EcoR I, and the resulting fragment was 

35 ligatcd into the EcoR 1 site of the vector pIN-IIIompA2 (see, e g., see, U.S. Patent NO 
4,575,013 to Inouye; and Duffaud et al. (1987) Meih. Em, 755:492-507) using the 
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. methods.described in Example 1. A. The ligation was accomplished such that the DNA 
.encoding . saporin, including the N-tenninal extension, was fused to the leader peptide 
segment of therbacterial ompA gene. The resulting plasmid pOMPAG4 contains the 
Ipp promoter (Nakai?iura.et.al.. (1979) 'Ce^r 75. 1 109-1 117): the £. coli lac promoter 
5 operator sequence (lac O). and the £.• coli omijA gene secretion signal in operative 
, association with each other and \vith the - saporin and native N-terminkl leader-encoding 
; DNA listed in .SEQ ID NO 8. The plasmid also includes the r co/i lac repressor gene 

. ...(lacl)... . . ... •:. -.. ' .•" 

' - , , The M13 mpl 8^G1 , -G2- -G7; and -09 clones obtained from Example 
10 1.B.2, containing SEQ ID NOs. 9-12, respectively^ are digested with £coR 1 ind ligated 
into EcoR I digested ;pIN-IIIompA2 as described for M13 mpl8-G4 above in this 
example. ^.The^ resulting plasmids, labeled pOMPAGl2;rpOMPAG2. pOMPAG7, 
. pOMPA9, are • screened; expresse^dv. purified, and ; characterized as described for the 
. ::plasmid pO|ViPAG4. t. c . . -. ' . ; • . • . . 

' ^ . : V 1 a competent cells were transformed With pOMPAG4 knd cultures 

. containing (the desired plasmid sUncture were grown further in-order 'to obtain a large 
. r, preparation of isolated pOMPAG4plasmid.using methods described in Example I.A. 
. Pr ; Saporin expression in £1 co/f - i^^^^^^^ ^: - - x'.":: 1 >: 

' - - ' ' ?"he pOMPAG4:-transformed >£. bo// ce^ 

20 in which the expression of the saporin-containing protein is' repressed by the lac 
< . . repressor to an. O.D. in or at ihe end of the log phase of growth after which IPTO was 
added tq induce expressiori'of the saporiri-encoding DNA. ■ " ' ' •■ 
• ' ' • To 'generate a large-batch culture of pOMPAG4 transformed £. coli 

cells, an- overnight, culturfe (lasting approximately 16 hbui^s) of JA22I £. coli cells 
.25 transformed with the plasmid-pOMPAG4 in LB broth (jee. e g.; Sambrook et al. (1989) 
„ Molecular. Cloning: A- Laboratory Mahudh Cold Spring flarbbr Laboratory Press, Cold 
, .V » ; Spring Harbor. NY) containing 125 mg/nil ampitillin wa!s diluted 1:100 into a flask 
. :,.cpntaining 75.0 ml LB broth with 125 mg/ml ampicillin. Cells w6re grown to 
logarithmic phase with shaking at 37^ C until the optical density at 550 nm reached 0.9. 
•'^ ;f 1" the second step: saporin expression was induced' by the addition of 

IPTG (Sijima) to a . final concentration of 0.2 mM; Induced cultures were grown for 2 
additional hours and then harvested by centrifugation (25 min., 6500 .x g). The cell 
pellet was resuspended in ice cold l .OM-.TRISjtpH 9.0, 2 mM EDTA (10 ml were 
. added to each gram of pellet). The resuspended -material was kept on ice for 20-60 
.35 , minu».es and then centrifuged(20 min;, 6500 x g) to separate the periplasmic fraction of 
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£. coli, which corresponds to the supernatant, from the intracellular fraction 
corresponding to the p>ellet. 

As described below, (see, Exaimple 3), it has been found that it is 
preferable to perform the manipulations previously conducted at 37^ C at 30^ C. 
5 E. Assay for cytotoxic activity 

The RIP activity of recombinant saporin was compared to the RIP 
activity of native SAP in an in vitro assay measuring cell-free protein synthesis in a 
nuclease-treated rabbit reticulocyte lysate (Promega). Samples of immunoaffinity- 
purified saporin {see, e.g., Lappi et al. (1985) Biochem. Biophys. Res: Comm. 729:934- 
10 942) were diluted in PBS and 5 )x\ of sample was added on ice to 35 |il of rabbit 
reticulocyte lysate and 1 0 |xl of a reaction mixture containing 0.5 fil of Brome Mosaic 
Virus UNA, 1 mM amino acid mixture minus leucine, 5 \xC\ of tritiated leucine and 3 ^1 
of water. Assay tubes were incubated 1 hour in a 30 C water bath. The reaction was 
stopped by transferring the tubes to ice and adding 5 ^1 of the assay mixture, in 
15 triplicate, to 75 \i\ of 1 N sodium hydroxide, 2.5% hydrogen peroxide in the wells of a 
Millititer HA 96-well filtration plate (Millipore). When the red color had bleached from 
the samples, 300 \i\ of ice cold 25% trichloroacetic acid (TCA) were added to each well 
and the plate left on ice for another 30 miri. Vacuum filtration was performed with a 
Millipore vacuimi holder. The wells were washed three times with 300 \i\ of ice cold 
20 8% TCA. After drying, the filter paper circles were punched out of the 96-weIl plate 
. and cotmted by liquid scintillation techniques. 

The IC50 for the recombinant and native saporin were approximately 20 
pM. Therefore, recombinant saporin-containing protein h<is full protein synthesis 
inhibition activity when compared to native saporin. 

25 

EXAMPLE 2 

Preparation Of Starting Plasmids - PZIA, PZIB, PZIC AND PZID 

30 . 

A. General Descriptions . 

1. Bacterial Strains and Plasmids: 

£. coll strains BL21(DE3). BL21(DE3)pLysS, HMS174(D'E3) and 
HMS174(DE3)pLysS were purchased from Novagen, Madison, WI. Placid pFC80, 
35 described below, has been described in the WIPO International Patent Application No. 
WO 90/02800, except that the bFGF coding sequence in the plasmid designated pFC80 
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herein has .the ^quence set forth as SEQ ID NO. 34, nucleotides 1-465. The plasmids 
described herein may be prepared using pFC80 as a starting material or, alternatively, 
1. with a fragment.<»n;aining:t^^ 19) 

linked to the F.GF-^nNcodjingjD?vJA;.(SEQ iD NO. ,34). . 
5 2. DNA Manipulations , * / - i ' . 

. , . . .Tlife . restriction . and; modifi^^ employed here are 

, : . cprmiierciaJI;^, ^vaii^le in,tHevU.S. Native SAPv chemically conjugated bFGF-SAP and 
rabbit polyclon^ antiserum.to SAP were obtained as described, for example, in Lappi et 
al.,(1985) Biochiem^ Biophys. Res. Comm. /2P. 934.4942, Lappi et al:^ (1989) Biochem. 
10 Biophys.. Res: Cpmm:J6Q:^M^,zxi^ U.S. Patent.Wo. 5,191,067. Iht pEt Systfem 
: ^a§ ;Purphased fi^^^ WI,; The sequencing of the 

differenl constructions, was^dpne^ using riie Sequenase kit of ^ United States Biochemical 
Corporation (version 2.0), rMiniprep&cation , and-^ maxiptteparations of plasmids, 
preparation Pf competent cells, trMsfomiation; Ml 3 ;manipula^ bacteriat wedia and 
1 5 Western blotting were performed using routine i methods {see, e.g.] .Sambrook et al. ^ 
: (1989) Molecular Cloning. A Laborafory' Manual ■ Cold Spiingm^aboT Laboratory' 
, . . Press„Gold Spring Harbor,, NY). The purificatiori of DNA fragments was done using 
- .9^"«<^'?^ ^'V purchased froraBio 3 Ql ' SDS g was performed' 

. ._on..a Phastsystem.(Pharmaqw^^ -^-tv.' s . ^:,!;>'nd : r.^r -i 

■20 ■ v^ 3;- -t-/. ',Cytotoxicity:«ssays.of conjugaitfir- ^ . :.: 'J... 

Cytotoxicity experinwjnts i^ei performed With:: t^ 1Pr6m<;ga (Madison, 
, . :W1) GellTiter.96 Cell Proliferation/Gytotoxicity Assay; Cell types are A431 or SK- 
MEL28cellsi 2500 cells are plated perwellv'i ' .' ; ■ . " :. 
B. Construction of plasmids encoding FGP-SAP fusion protein^ 

2^ 1- Construction of FGFM13 that contains DNA encoding the CI 

ribosome binding site linked to FGF 

A Nco I restriction site wasi introduced into the SAP-encoding DNA of 
the M13mpl8-G4 clone, described in Example 1, by site-directed mutagenesis method 
using the AmcTsham^n vi7ro-iriutagenesis system 2.1 . The oligonucleotide employed to 
create the Nco I restriction site was synthesized using a 380B automatic DNA 
synthesizer (Applied Biosystems) and is has the sequence ' (SEQ ID NO. 17): 
CAACAACTGCCATGGTCACATC. : . This oligonucleotide containing the I^cq I site 
replaced the original, SAP-containing coding sequence at. SEQjID NO. 8, nts 32-53. 
:,The resulting M13mpl 8-04 derivative was designated mpNG4. : ; r . • 

_ . , In order to produce a bFGF .coding sequence in jMiuch ?the- stop codon 
.was removed, tl;»e FpFwencoding DNA was subcloned into a M1.3.phage^and-subjected 
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to site-directed mutagenesis. Plasmid pFC80 is a derivative of pDS20 {see, e.g., 
Duester et al. (1982) Cell ia:855-864; see also U.S. Patent Nos. 4,914,027, 5,037,744, 
5,100,784, and 5,187,261; see, also, PCT International Application No. WO 90/02800; 
and European Patent Application No. EP 267703 Al), which is almost the same as 
5 plasmid pKGlSOO (see, Bemardi et al. (1990) DNA Sequence 7:147-150; see, also 
McKenney et al. (1981) pp. 383-415 in Gene Amplification and Analysis 2: Analysis of 
Nucleic Acids by Enzymatic Methods. Chirikjian et aL, eds. North Holland Publishing 
Company, Amsterdam) except that it contains an extra 440 bp at the distal end of galK 
between nucleotides 2440 and 2880 in pDS20. Plasmid pKG1800 includes the 2880 bp 
10 Eco R I» Pvu II of pBR322 that contains the ampicillin resistance gene and an origin of 
replication. 

Plasmid prCSO was prepared from pDS20 by replacing the entire galK 
gene with the FGF-encoding DNA of SEQ ID NO. 34, inserting the trp promoter (SEQ 
ID NO. 18) and the bacteriophage lambda CII ribosome binding site (SEQ. ID NO. 19; 

15 see, e.g., Schwarz et al. (1978) Nature 272. 410) upstream of and operatively linked to 
the FGF-encoding DNA. The Trp promoter can be obtained from plasmid pDR720 
(Phamiacia PL Biochemicals) or synthesized according to SEQ ID NO. 18. Plasmid 
pFC80, contains the 2880 bp £coR l-BamW I fragment of plasmid pSD20, a synthetic 
Sal hNde I fragment that encodes the, Trp promoter region (SEQ ID NO, 1 8): 

20 /' '[[ 

EcoRi : 
MTrCCCCTGTrGACMTrAATCATCGMCTAGTTAACTAGTAG6(:A^ 
and the CII ribosome binding site (SEQ ID NO. 19): 

25 Sal I . : : Nde I 

GTCGACCAAGCTTGGGCATACATrCAATCAATTGTTATCTAAGGAAATACmCATAT^ . 

The FGF-encoding DNA was removed from pFC80 by treating it as 
follows. The pFC80 plasmid was digested by Hgal and Sail, which produces a 

30 fragment containing the CII ribosome binding site linked to the FGF-encoding DNA. 
The resulting fragment was blunt ended with DNA pol I (KJenow fragment) and 
inserted into M13mpl8 that had been opened by Smal and treated with alkaline 
phosphatase for blunt-end ligation. In order to remove the stop codon, an insert in the 
ORI minus direction was mutagenized using the Amersham kit, as described above, 

35 using the following oligonucleotide (SEQ ID NO. 20): 
GCTAAGAGCGCCATGGAGA, which contains 1 nucleotide between the FGF 
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carboxy teraiinal serine codon and a Hcq I restriction site, arid' it replaced the following 
wild type FGF encoding DN A having SEQ ID NO. 21:' 

- ■•• JS'Viv • -• ..'v. GDP.AAG-AGG TGA- CCA TGG AGA. ••■ ■ "' 

'.\ •.. -.*:\>'.AU-l^s'-Ster STOP • Pro ^Trp'Arg-^'- 



i 



; 0 Liii r Xhe resulting, mutant' derivative of Wl3mp^ native stop 

codon afterrithejcarhtoxy terminal serine codon of bFGF, was designated FGFKil3. The 
; . mutagenized region of FGFM 13 contained the correct sequence (SEQ ID NO. 22). 

10 2. . Preparatidn ofHpIasniids'p^^^ that 

encode the FGF-SAP fusion protein ' ~ 

' ,a. ' Plaisniid pFS92 (also designated PZi A) 

:Plasmid FGFM13 Wcut-with a)co I a^^^^ 
; containing the Cll ribosome binding site linked to the bFGF coding sequence with, the 
. 15 .- stop codon replaced; ' ^- ' - - ■ • " ' '• 

The Ml 3mp 18 derivative mpNG4 contMnin^ the saporin' coding 
sequence was also cut with restriction enddnucle^es TVc^ and 5ac l, and the bFGF 

; coding .fragment from^FGFM13 was inserted by ligatib^^^^ DNA encoding fiision 
protein i bFGF.SAP: into the M13mpl8 deiiUtive' <6 produce ^TOpFGF^^ 

20 contains the Cll ribosome binding site linked to the FGF-SAP fusion gene. The 
sequence of the fusion gene is set forth in SEQ ID NO. 34 and indicates that the FGF 
protein carboxy tenninus aih^ -the saporin pr^^^^ teftriiniis are separated by 6 

nucleotides (SEQ ID NOs. 34;^and 35. nts 466-471) that enddde two amino acids Ala 
Met. 

' Plasmid FGF-SAP was digested with^a I and EcoR I and the resulting 
fragment containing the bFGF^SAP coding seqiienbe isolated and ligated into 
plasmid pET 11a (available from Novagen, Madison, WI; for a description of the 
i plasmids.-see U.S, Patent -No." 4,952.496; see. also Stiidier et al. (1990) Meth. Enz. 
/<S5. 60-89; Studier et al.^(I986) X Mol. Biol/ I89:lli-I30i Rosenberg et il. (1987) 
Ge»e J<5. 125-135) that' had also been treated viith EcoR \ and Xbah ' The resulting 
plasmid was designated pFS92. It was renamed PZl A. " ' . • - ■ 

- V .-y .,. Plasmid pFS92 (or PZl A) contains DNA encoding the entire basic FGF 
protein (SEQ ID NO. 34), a 2-amino acid long Connectirig peptide, and amino acids 1 to 
253. of the mature SAP protein. Plasmid pFS92 also includes the Cll ribosome binding 
35 site linked to the FGF-SAP fusion protein and the T7 promoter region from pET 1 1 a. 
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£. coll strain BL21(DE3)pLysS (Novagen, Madison WI) was 
transformed with pFS92 according to manufacturer's instructions and the methods 
described in Example 2.A^.2. 
. . b. , PlasmidPZlB • : 

5 . , Plasmid pFS92. was digested with Eco^ X\ the ends repaired by adding 

nucleoside triphosphates and Klenow DN A polymerase, and then digested with Nde I to 
release the FGF-encoding DNA without the cll ribosome binding site. This fragment 
was ligated into pET 1 la. which had been BamH I digeisted; treated to repair the ends, 
and digested with Nde I, The resulting plasmid was designated PZIB. PZIB includes 
10 the T7 transcription terminator and the pET 1 1 a ribosome binding site. 

. £. coli strain BL21(DE3) (Novagen, Madison WI) was transformed with 
PZl.B according to manufacturers instructions and the methods described in Example 
2.A.2. 

c. Plasmid PZIC 

15 Plasmid PZIC was prepared similarly to PZIB but contains a kanamycin 

resistance gene and is based on the pET 13a vector. 

d. Plasmid PZID 

Plasmid pFS92 was digested with EcoK I and Nde I to release the FGF- 
encoding DNA without the CII ribosome binding site and the ends were repaired. This 
20 fragment was ligated into pET 12a. which had been BamW I digested and treated to 
repair the ends. The resulting plasmid was designated PZID. PZID includes DNA 
encoding the OMP T secretion signal operatively linked to DNA encoding the fiision 
protein. 

E, coli strains BL21(DE3), BL21(DE3)pLysS, HMS174(DE3) and 
25 HMS174(DE3)pLysS (Novagen, Madison WI) were transformed with PZID according 
to manufacturer's instructions and the methods described in Example 2. 

EXAMPLES 

30 

Preparation Of Modified Saporin 

Saporin was modified by addition of a cysteine residue at the N- 
terminus-encoding portion of the DNA or by the addition of a cysteine at position 4 or 
35 10. The resulting saporin is then reacted with an available cysteine or sulfhydryl- 
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.reacting moiety on a targeting agent to produce conjugates that are linked via the added 
. Cys or.Met-CyS;On saporin. • ■ " < 

Modified SAP has been prepared by altering the DNA encoding the SAP 
by inserting DNA encoding Met-Cys at position -1 or by replacing the He or the Asn 
5 cpdon within 10 oi- fewer iiesid.ues of theN-terminus with Cys. The resulting DNA has 
■ pET Irla.and pET. lSb and expressed in BL21(DE3) cells. The 

resulting saporin proteins are; designated FPSl ^(saporin With Cys at -1), FPS2 (saporin 
. . . with Cy^ at position .4) and FPS3..(saporin with Cys^ at position 1 0). A- pli^mid that 
^ ,encodes FPSl and, that has been used, for ..expression of FPSl has been designated 
1 0 PZ50B. Plasmids that trncode .FPS2 and Wat have been iised for expression of FPS2 
have been designated;: PZ5ia.(pET|la.based: plasmid^ (pET15b-based 
plasmid). Plasmids that encode FPS3 and .that have-been used , for expression of FPS3 
have been designated PZ52B (pETl la-based plasmid) and PZ52E (pET ' 1 5b-based 
plasmid). . , . .- j.^. ... < . 

.15 A. . Materials and Methp,ds - t. .- ,. ■ r --- ! .-, i 
I. Bacterial strains -r-- -:" i -.-. -rr 

Novablue (Novagen, Madison,' WlVand BL2 1 (DE3) (Novagen, Madison 



WI).- 



. DNA: manipulations 1 ^^2. 

Plasmid PZIB (designated PZIBI (the M'^ at the ehd refers to thi bacterial host strain. 
... - BL21(DE3)) described in Exiampl^ 2 was lised as the' DNA template.' " 

B. Preparation of saporin with an added cysteine residue at the N-terminus 

':---^-\ 1, Primers-'-' .t .r^' (; A. - ^ ^ • _ •'■ 

(a) Primer #1 corresponding to the sense strand of saporin, 
' ' nucleotides 472-492 of SEQ ID NO, 34, incorporates a Ndel 
site and adds a cys codon 5* to the first codon of the mature 
protein (between Met and Val): 

CATATGTGTGTCACATCAATCACATTAGAT (SEQ ID NO. 15). 

^® (*^) Primer #2 - Antisense primer complements the coding 

sequence of saporin spanning nucleotides 547-567 of SEQ ID 
NO. 34 and contains a BamHl site: 

CAGGTTTGGATCCrrTACGTT (SEQ ID NO. .16) . 
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2. Isolation of saporin^encoding DNA 

PZIB DNA was amplified as follows using the above primers. PZIB 
DNA (1 \x\) was mixed in a final volume of 100 ji! containing lOmM Tris-HCl 
(pH 8.3), 50 mM KGl, 0,01% gelatin, 2 mM MgCl2, 0.2 mM dNTPs. 0.8 ng of each 
5 primer. Next, 2.5 U Taql DNA polymerase (Boehringer Mannheim) was added and the 
mixture was overlaid with 30 pil of mineral oil (Sigma). Incubations were done in a 
DNA Thermal Cycler (Ericomp). One cycle included a denaturation step (94 C for 
1 min). an annealing step (60 C for 2 min,). and an elongation step (72^ C for 3 min). 
After 35 cycles, a 10 ^il aliquot of each reaction was mn on a 1.5% agarose gel to verify 

10 the correct size of the amplified product. 

The amplified DNA was gel purified and digested with Ndel and BamHl 
and subcloned into AWel and JS^mHI -digested pZ13. This digestion and subcloning 
step removed the FGF-^encoding DNA and 5' portion of SAP up to the BamHl site at 
nucleotides 555-560 (SEQ ID NO. 34) and replaced this portion with DNA encoding a 

15 saporin molecule that contains a cysteine residue at position -1 relative to the start site 
of the native mature SAP protein (see, SEQ ID NO. 58). The resulting plasmid is 
designated pZ50B. 

C. Preparation of saporin with a cysteine residue at position 4 or 10 of the 
native protein - 

20 „ These constructs were designed to introduce a cysteine residue at 

, position 4 or 1 0 of the native protein by replacing the isoleucine residue at position 4 or 
the asparagine residue at position 10 with cysteine. 

1. Materials 

(a) Bacterial strains 

25 The bacterial strains were Novablue and BL21(DE3) fNovagen. 

Madison. WI). 

(b) DNA manipulations 

DNA manipulations as described above. • 

2. Preparation of modified SAP-encoding DNA 

30 SAP was amplified by polymerase chain reaction (PCR) from the 

parental plasmid pZl B encoding the FGF-SAP fusion protein. 
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(a) Primers 

■ The primer corresponding to the seiise strand of saporin, 

: . i ' : ;^PI»nnm^ ID NO. 34, 

,/ ! / . :s W9on?o«:»tes a N^fl site and repl^m the He codon with a Cys 
codoh at po^^^^ 

■'■■"'CATAfGGTCXCAt^^ 

, .,. . , .,(?)...„- T^%PPV'*^V, cprre&ponding to the, sense , strand of saporin, 
niicieotides 466-515 of iSEQ ID NO. 34, incorporates a Ndel 
• ■ ' • • ' site and repiaceis the Asn codbn with a cys codon at position 
I . . < , '10 of the mature prbteiii ([SEQIDNO.60) " ' 

CATATGGTCACATCAATCACATTAGATGT^TATGTGGGACCGCGGGTCA. 

' . (3)" - Primer' #2'''- Avti's^ns^^ 'pnmer complements the coding 
Sequence of saporin^^ ispafnnibg- nucie^^ 

NO- 34 and contains a'BamHr^^^^ 

15. ' CAGGTnGGAXCCTITACGTT, . . v. ■ „. ■ j ;, ■. ; • o , ■,: , u. • ■ - . 

; . , . ,,.Amplificatibn i \ .r.:' ■: ..: 

' The nucleic acid' amplification, reactions' wete 'performed as described 
above, using the following cycles: denaturation step 94 °C for 1 min, annealing for 2 
i f^:^OT^^<^fx.t^"?t^on,for 2 min,at 72'X for35,cy.cles.^. 0^^ DNA was 

20 gel purified, digested with Ndel and Bamlll, and subcl<ftied iHt(V Ndel and EamHI 
.....digested pZlB. This digestion -remdvid the FGF aitfd ^S' pbition of SAP (up to the 
: EamHI. site) from, the parental FGF-S AP vectoY (pZ IB) and 'r^filace<l thi's pbrtibn with a 
SAP molecule containing a CYS at pdsitibn +v4 br +10; relative to the start site of the 
native mature SAP protein (see SEQ ID NOs. 36 and 37,: respectively). The resulting 
25 plasmids are designated pZ5 1 B and pZ52B, respectively., \ > 
D.. Cloningjof DNA encoding SAP mutants in vector pETlSb 
1. The SAP-Cys-1 mutants 

The initial step in this construction was the mutagenesis of the internal 
BamHl site at nucleotides 555,-560, (SEQ ID NO. 34) in pZlB using a sense primer 
30 corresponding to nucleotides , 543-570 (SEQ ID ; NO. . 34) . but changing the G at 
, . . nucleppde 555 (the. third, position in the Lys codon)-tp an A.-^.The complement of the 
sense primer was , .used as . the ami sense primer:. 5' 
TTTCAGGTTTGGATCTTTTACGTTGTTT 3' (SEQ ID NO. 61). The first round of 
amplification used amplification reactions, conducted as in B above, with primers 
35 having SEQ ID NOs. 1 5 and 61 (set forth above) and primers having SEQ ID NOs. 62 
and 24 as follows: 
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5' AAACAACGTAAAAGATCCAAACCTGAAA 3' (SEQ ID NO, 62) 
5' GGATCCGCCTCGTTTGACTACTT 3* (SEQ ID NO. 24) . 

Individual fragments were gel purified and a second round of amplification was 
5 performed using primers of SEQ ID Nos. 15 and 24, performed as in B., above. This 
amplification introduced a Nde l site and a Cys codon onto the 5* end of the saporin- 
encoding DNA. The antisense primer was complementar>' to the 3' end of the saporin 
protein and encoded a BamHI site for cloning and a stop codon (SEQ ID NO. 24): 

The resulting fragment was digested with Ndel/BamUl and inserted into 
10 pETlSb (Novagen, Madison, WI), which has a His-TagTM leader sequence (SEQ ID 
NO. 23), that had also been digested vnth Ndel/BamHL The sequence of SAP-Cys-1 is 
set forth in SEQ ID NO. 58). 

2. The SAP-Cys+4 and Sap-Cys+10 mutants 

This construction was performed similarly to the SAP-Cys-1 using pZl B 

15 as the starting material, and splice overlap extension (SOE) using PZIB as the starling 
plasmid, including mutagenesis of the internal Bam Hl site at nucleotides 555-560 (SEQ 
ID NO 34) in pZ IB using a sense primer corresponding to nucleotides 543-570 (SEQ 

^ ID NO- 34) but changing the G at nucleotide 555 (the third position in the Lys codon) to 
an A and introductf on of the cys at position 4 or 1 0 in place of the native amino acid. 

20 The first round of amplification used primers of SEQ ID NOs. 59 and 61 (for the 'cys+4 
saporin mutants) or SEQ ID NOs. 60 and 61 for the cys-flO saporin mutants): 
CATATGGTCACATCATGTACATTAGATCTAGTAAAT (SEQ ID NO. 59); and 
CATATGGTCACATCAATCACATTAGATCTAGTATGTCCGACCGCGGGTCA 
(SEQ ID NO. 60); TTTCAGGTTTGGATCTTTTACGTTGTTT (SEQ ID NO. 61 ). For 

25 each construction, the second rotmd of amplification included the Augment prepared in 
D. 1 above, using primers having SEQ ID NOs. 62 and 24. 

Amplification conditions were as follows: denaturatioh for I min at 94^ 
C, annealing for 2 min at 70^ C and extension for 2 min at 72° C for 35 cycles. 
Individual fragments were gel purified and subjected to a second round of 

30 amplification^ following the same protocol, using only the external oligonucleotides of 
SEQ ID NO. 24 and SEQ ID NO. 59 for the cys+4 mutant or SEQ ID NOs. 60 and 24 
for the cys+10 mutant. The resulting Augments had a Ndel site on the 5* end of the 
saporin-encoding DNA and a BamHI site for cloning and a stop codon on the 3' end. 
The resulting fragment was digested with Ndel/BamHl and inserted into pET 15b 

35 (Novagen, Madison, WI), which has a His-Tag™ leader sequence (SEQ ID NO. 23), 
that had also been digested Ndel/fiamHI. 
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DNA encoding unmodified. SAP (EXAMPLE 1) can be similarly 
inserted into a pET15b or pETl 1 A and expressed as described below for the modified 
SAP-encoding DNA. 

' ^ ;i ., lJ^r l(?|E3): cdls were ..trarisfoiroed with the resulting plasmids and 

9ultured ^.desci^t^edin, Example /I ; except that all incubations were condJcted at 30° C 
. <r?^ ^- 9':ieily,.a sipgle colony was 'grown in LB AMP] oo to an OD600 of 

l.P-1.5_^d then induced .^with lPTG (final. cohcentratibn 0.2 mM)-for 2 h. The bacteria 
were spimdown.- ;.■ ' i..-f ...,- . j , . 

10 F. , Purification of m"odiOedsa|ionn^^ ^^^^^^ ■ ' . ; . 

. .. v:,r. vLysis:,:lSUffet, (20 mM' .NaP04: 'pH 7.0, 5 ink^ EDtA. 5 mM EGTA. 
1 mM D ri , 0.5 ng/ml leupeptin, 1 ^g/ml aprotinin. 0.7 ^g/'^nl pepstatin) was added to 
the rSAP cell paste (produced from pZSOB'iii BL21(bE3) cells! as described above) in 
a ratio of L5 ml bpffer/g, cells. .: This mixture was evenly susjiended via a Polytron 
15 homogeni^rand.passed\throughamicrofluidizertW ' 

; . ' : V ^. i ^The resulting lysate. was, centfifuged af -50:000 rpm for 45 rriiii. The 
, - supernatant; ^as diluted.,with^S^ A (20 mM NaP04,; V mM EDT A. pH 't.O) so 

■ i::^^ :^=9nduciiyity -was<be!pw:2=5..mS/cm; The'"dilut^^ u4's then 

°"^9 .^. SPiSephar0se cplumm andiaJjrieai^:grSi&i^Ht-6f 0 to '30^^^ |3 ' 

2Pv, (l.,M.Naei.;2Q.mMcNaP04,> 1 m^ EDTA- i5H 7.^ in' Buffer 'X '^ tbtal of 6 
. column , volumes , was applied.^ FrSetidris bbiitaihmg' VsAP we^^^^ the 
, resulting rSAP had a purity of greater than "W*. ' ' ■ ' ' ' ' ' " " ' ' ' ' ' 

. ; ) : ■ Ar buffer^ exchaihgie- step • was us^d tt/"get the eluaie into: a buffer 

^ ' containing .50 nfiM'NaB03, 1 hftMfEDT A; pH 8.5 (S Buffer A).' This sample was then 
25 appHed taa-Resourcer^S column (Phannacia/Swed^ with S Buffer A. 

Pure rSAP was eluted afftfiie fcdlimui b/io cdlumn volumes of ^linear gradient of 0 to 
300mM.NaCr:in SP Buffer A! The final f SAP was approximately 98% pure and the 
; overall, yield' of I'SAP was abbut-'50»/o (the amount of ^AP in cnide; lysate was 
' determined by ELISA);'- - • ' ' ' ,.>..-. - w 

30 J : ;: v ln''this:^preparation; tiltrac used to clarift'.the jysate: 

other methods/sdch as: filtration 'and using flociiients also can' be used, in addition. 
Streamline S (PHARM'ACl aI Sweden) may also be used for large scale preparations. 
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EXAMPLE 4 

Construction Of Plasmids Encoding HBEGF-SAP Fusion Proteins 

A. Materials 

S 1. Bacterial Strains and Plasmids 

E coli strains BL21(DE3), BL21(DE3)pLysS. HMS174(DE3) and 
HMSl 74(DE3)pLysS were purchased from Novagen. Madison. Wl. 
2* DNA Manipulations 

The restriction and modification enz>'mes employed here are 
10 commercially available in the U.S. Minipreparation and maxipreparations of plasmids. 
preparation of competent cells, transformation. Ml 3 manipulation, bacterial media and 
Western blotting were performed using routine methods {see. e.g., Sambrook el al. 
(1989) Molecular Cloning: A Laborator\' Manual. Cold Spring Harbor Laborator> 
Press. Cold Spring Harbon NY). The purification of DNA fragments was done using 
15 the Geneclean II kit, purchased from Bio 101, SDS gel electrophoresis was performed 
on a Phastsystem (Pharmacia). 

B. Removal of FGF Sequences from PZIBI 

The plasmid PZIBI contains DNA encoding FGF linked to DNA 
encoding saporin via a spacer region encoding two amino acids (Ala-Mel). The fusion 

20 gene is cloned into the Nde\ and BamVil sites of the plasmid vector pETl la (Novagen). 
The vector provides a T71ac promoter, a lac operator, and a ribosome binding site 
upstream of the fusion gene, and a T7 terminator downstream of the fusion gene. 

To remove FGF sequences. PZIBI was digested with the restriction 
enzymes hIdeX and Ncol. Ndel cuts once within the plasmid at a position encompassing 

25 the translation initiation codon (ATG) for FGF. Ncol also cuts once within the plasmid 
at a site within the two amino acid linker region of the plasmid PZIBI. Digestion of 
PZIBI with Ndel and Ncol thus generates two fragments: a FGF-fragment and a 
fragment containing vector (pETlla) and saporin-encoding sequence. The digestion 
products were resolved in an agarose gel and the vector/saporin fragment, was purified 

30 using the Geneclean II kit. 

€• Amplification and isolation of DNA encoding mature HBEGF 

The plasmid pJMU2-l (see. gift^ from. Dr. J. Abraham: see. also 
International PCT Application WO 92/06705. which is based on U.S. Application Serial 
No. 08/598.082) contains a 2.36 kb human HBEGF cDNA fragment (with added £coRl 

35 linkers) cloned into the £coRl site of pUC9 (see. e.g.. Viera el al. (1982) Gene 79.259- 
2678: see. also GB 2045254 A: available from numerous commercial sources, such as 
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Phannaca Fte Chemicals. Piscaaway. NJ). This fragmen, encodes a 208 amino acid 
p™ fon. of HBEGF (SEQ ID NO. 1). Any plasmid iha, contains .he cDNA 
?RPr? ^" '™f '«^'^'P""« (c.g.,.pM™™EGF. ATCC *40900 and pAX- 
HBEOF. ATCC M0899) may also be used for HBEGF amplification as described 
J nerem. 

HRFrp ; ' '^.^Bion <>f,ihc UNAti^ encodes a. 77. amino acid fonn of mature 
HBEGF corresponding ,o nuc:leotides.;i 7-44X.or.amino acids.73-l;49 in the precursor) 
was ampl^.ed. The primer (SEQ ID NO. 25) corresponding, to the "sense" strand of 
/"^'r --J"des an Nd£l i^striction site adaptor (and a Met codon) upstream of 

10 the codon for ammo acid 73 of precursor HBEGF.(SEQ IT? No. .1) and spans the first 14 
nudeotides of the DNA sequence encoding ^m^^^^^ H.PEGF (SEQ ID NQ. 3): . . 

. . , -. ' , . . .''.^'-CTGCACCATATGAGAGTCAC^^ (SEQ ID .n6."..25)! 

„nr.^J ^ ^ > The primer (SEQ lb IsJO.;26) cprresp9ndh;g to th^"anti sense" strand of 
HBEGF complements 23 nucleotides encoding amino acids 144.149 of the precursor 
dZIsEQ^T^^ ^-^^?-^'on site down,tream:pf:tb^^ 

\ ' . ^...^PH^.%^«y rwi^tioii/Wi^oinicxI tn which plasmid pJMU2-l 
DNA (200 ng) was mixed in a final volume of 100 m1 containing 10 mM Tris-HCl (pH 
8 50 mM KCl. 1.5 mM MgCl2, 200 mM dNTPs; 1.00 pmoje of each primer, and 2 5 
.5 UTaqI polymerase (Bbehringer Mam^heim). . Amplifications were done in a TwinBlock 
DNA thermal cycler (Ericomp). Tlie first cycle was a denaturation step (94-C for 5 
mm): The second cycle was repeated 30 times and included a .deriatu^t ion step an 
■ anneahng step, and an elongation step (94»C for 1 min; 62°C for 2 min- 7-' »C for -> 
^ min). The third cycle was an elongation step (72 «>C for 7 min). An aliquot of the 
.0 reaction was run on an agarose gel and the amplified product wa^ purified using the 
Geneclean II kit (Bio 101). 

D. Preparation of plasimids that encode the HBEGF-SAP fusion protein 

' • ; - ' The purified amplified product encoding HBEGF was then digested with 
Nde\ arid /?cal (Which generates an end compatible with Arol) and ligated into the 
.5- NdeVNcol sites of PZ 1 Bl. Following transformation into the E. Coli strain NovaBlue 
; (Novagen) positive clones were identified by restriction; eniymp digestion of miniprep 
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DNAs. A positive clone designated pZ31Bl was sequenced starting within the vector 
sequence (using the T7 promoter primer) and extending through the HBEGF-coding 
sequence, the Val-Met two amino acid linker (generated by ligation at the Real and 
Ncol sites), and into the saporin sequence. The positive pZ31Bl plasmid gave the 
5 proper nucleotide sequence (i.e., SEQ ID NO. 6) for the HBEGF-SAP fusion gene. The 
fusion protein encoded by the plasmid pZ31Bl contains 78 amino acids of HB-EGF 
(including a methionine introduced by the Ndel restriction site), a two amino acid 
(Val-Met) linker and 253 amino acids of saporin (SEQ ID NO. 6). The pZ3IBl 
encoded fusion protein is therefore 333 amino acids long with a predicted molecular 
1 0 weight of about 37.6 kD and an isoelectric point of 9.6. 

E. Expression of the recombinant HBEGF-SAP fusion proteins 

The two-stage method described above was used to produce recombinant 
HBEGF-SAP protein (hereinafter HBEGF-SAP fusion protein) encoded by pZ31Bl. 
Plasmid pZ31Bl was transformed into E. coli strain BL21(DE3), which contains 
15 chromosomal copies of the T7 RNA polymerase gene linked to an IPTG-inducible 
lacUV promoter. 

1. . Small scall preparation 

The transformed E. coli cells were grown in 50 ml cultures of LB broth + 
ampicillin (100 fig/ml) at 30^G to an OP600 of 0.7. :The second stage was commenced 
20 bv the addition of IPTG (0.1 m.M) to induce expression of the T7 RNA polymerase 
gene. Cultures were continued at 30^ C. Qne ml aliquots of the culture were removed 
just prior to IPTG addition and then hourly thereafter. Aliquots were centrifuged. 
resuspended in 1 ml lysis buffer (10 mM Tris pH 8.0. 2 mM EDTA. 0.01 mg/ml 
lysozyme, 10 mM DTT> and incubated for 1 hour at room temperature. Following 
25 centrifiigation, the lysed supematants were analyzed by Western blotting (using an anti- 
SAP antibody) for expression of the fusion protein. SDS gel electrophoresis was 
performed on a Phastsystem utilizing 10-15% gradient gels (Pharmacia). Western 
blotting was accomplished by transfer of the electrophoresed protein to nitrocellulose 
using the PhastTransfer system (Pharmacia), as described by the manufacturer. Anti- 
30 SAP, antibodies are used at a dilution of 1:1000. Horseradish peroxidase labeled 
anti-IgG was used as the second antibody (Davis et al. (1986) Basic Methods in 
Molecular Biologjs New York. Elsevier Science Publishing Co.. pp 1-338). The 
Western blot analysis demonstrated induction of a soluble protein with the predicted 
molecular weight. 
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of the bacterial lysates were also analyzed by an ELISA assay 
(using an anti-SAP antibody). The results of- this assay' cohfiniied the induced 
expression of HBEGF-SAP. • ' :■ ■ " 

:. •.■.'.-;-2-;; i >- 'Large-scale .preparation '' ■-- 

^ ' • '"^ ciilture were grown as described 

- above, in' a-felFmentation apparatus 'exeepf that carbd^icillin (100 jig/mi) was 

sub^ituted for ampid«inl The pelleted cii^ 
' ' F. ' Purification of HBEGF-SAP fusion - i u . 

Aliquots of the feraientktion culture p^te were removed from the 
1 0 freezer, resuspended in Bu^eV A (1 0 rnM NaCitrate -pH 6.0, 1 0 mM EDTA. 1 0 mM 
EGTA. 50 mM NaCl); amd lyged With'ai^ l^^^^ (Model' l iOY. Microfluidics 

Gorp.). The lysate was cehtHfu^ed at -lOO.OC^^ x § and the resulting supernatant was 
c loaded onto a S-Streamline"cdlumn (Phkr^acia) eq^^ A. The 

column was Washed- with ^buffer B (10 mM Nk-Phosphater5 mlCl EEiTA^ 5 mM EGTA 
15 at pH 8.0) containing 0.2 M NaGl ilntif the A280 of th4 eluate ^4'ciied bakeline. The 
HBEGF fusion protein was eluted with buffer B containing 0.8 M Nadl! '■ 

Fractions were analyzed 'foV presenile' of tttS fusion protein by SDS 
; PAGE and Western blotting. The HBEGF-cbntafnirtg fraction^ ^were pooled, diluted 4x 
'. .with, buffer B and -applied to 'a QtSej)Haf6se (Phdmktiaf ^^t^^ with '"" 

20- -buffer B. The flow through was applied di'rkfctly to' a SP^SeijhafBs'e- 1^^^^^^^^ caUon 
exchange column (Pharmaci'a) equiliWed'with buffSf B contaihirtg 6J^ M NaGl. The ' 
HBEGF fusion , protein AVas ^ eluted with a 0.2O .0 M ' NaiCr 'gradient. Fractions 
containing fusion protein'(asi deterinibed by SDS PAGE) were pooled arid loaded onto a 
^ :heparin-Sepharose GL6B (Pharmacia) column equilibrated with buffer G' (10 mM 
25 :NaCitrate-pH 6:0,.l mM EDTA.-l mM EGTA, 0.2 M NaGl). The' fusion protein was 
. . eluted with a.0.2-J .2^M NaGl gradient. Fractions containing fusion protein were pooled 
'. and NH4S04 was:'added to 2.0 M.- Following filtratibn. the material was applied to a 
' - Phenyl-.Sepharose HP column equilibrated with buffer C containing 2 M NH4SO4. The 
fusion protein wks eluted with a 2.0-0.0 M •NH4S64'gradiehtf 'Fractions containing 
30 fusion protein were pooled and applied to" a'SlOO size exclusion column eqiiilibrated 
, • with/H) m.M NaCitrate (pH 6.0). 0.1 M EDTA, 0:14 M NaGli'^ Fractions containing 
purified HBEGF fusion protein were then selected. ' " - ' 
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G. Characterization of the HBEGF-SAP fusion protein 

1. Effect of HBEGF-SAP fusion protein on cell-free protein synthesis 
The RIP activity of HBEGF fusion protein .ericoded by pZ31Bl was 

assayed as described in Example 1 for s^orin. The results indicated that the IC50 of 
5 the HBEGF-SAP fusion protein exhibits activity in this assay. » 

2. Cytotoxicity of HBEGFrSAP fusion protein 

Cytotoxicity experiments, are: performed with the Promfega (Madison. 
WI) CellTiter96 Cell Proliferarion/Gytotoxieity Assay. About 1,500 A431 cells 
(ATCC Accession No. CRL 1555), an epidermoid carcinoma cell line, are plated per 

10 well in a 96 well plate in 90 ^il HDMEM plus 10% FCS and incubated overnight at 3T 
C. 5% CO2. The following morning 10 (il of media alone or 10 ^l of media containing 
various coiiccMtratiuns of the fusion protein, HBEGF polypeptide or saporin are added 
to the wells. The plate is incubated for 72 hours at 37 C. Following the incubation 
period, the number of living cells is determined by measuring the incorporation and 

15 conversion of the commonly available dye MTT supplied as a part of the Promega kit. 
Fifteen \x\ of the MTT solution is added to each well, and incubation is continued for 
approximately 4 hours. Next, 100 of the standard solubilization solution supplied as 
a part of the Prom-ga kit is added to each well. The plate is allowed to stand overnight 

^ at room temperature and the absorbance at 560 nm is read on an ELISA plate reader 

20 (e.g., Titertefc Multiskan PLUS. ICN, Flow, Costa Mesa, CA). 

.- <- * ■ 

EXAMPLES 

25 Chemical Synthesis of HBEGF-SAP 

About 50-100 nmol of HBEGF that has been dialyzed against phosphate* 
buffered saline is added to about 2.5 mg mono-derivatized SAP (a 1.5 molar excess 
over the HBEGF polypeptide) and left on a rocker platform overnight. The ultraviolet- 

30 visible wavelength spectrum is checked in order to determine the extent of reaction by 
the release of pyridylthione, which adsorbs at 343 nm with a known extinction 
coefficient- The reaction mixtures are treated for purification in th^e following manner: 
reaction mixture is passed over a HiTrap heparin-Sepharose column (1 rhl) equilibrated 
with 0.15 M sodium chloride in buffer A at a flow rate of 0.5 ml/min. The column is 

35 washed with 0.6 M NaCl and 1 .0 M NaCl in buffer A and the product eluted with 4.0 M 
NaCI in buffer A. Fractions (0.5 ml) are Analyzed by gel electrophoresis and 
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absorbance at 280 nm. Peak tubes are pooled and dialyzed versus 10 mM sodium 
phosphate, pH 7.5 and applied to a Mono S 5/5 column equilibrated with the same 
.:buffer.. A 10 ml gradient between 0 and 1.0 M sodium chloride in equilibration buffer is 
. ; used toielute the product.7 . . • . 

5 Cytotoxicity of HBEGF.SAP i :- . ; - 

Cytotoxicity to several eel Itypes} ^uch as A-431 cells (ATCC Accession 
. ,,No..CRL J555) qr.other.sm6oth muscle cells is tested usirig-the Promega (Madison. 
, WI) CellTjt^r,?,6.Cell Proliferaticn/Gytotoxicity Assay described above in Example 4. 
The. HBEGF,SAP conjugate should be cytotoxic to each cell type that expresses an 
10. .p.GF receptor.;-. ■ , . ■"■i " : •.• ' 



15 



Construction of Plaskiids for Insertiot^ of LrfsTKERs 



20 



A. Construction of pIasmid PZ32Bl containing linker^ameriable HBEGF-SAP 
- : within the codiiig region of mature HBEGF 

; Plasmid.plasmid pJMU2U., : as described! in Exampl^^^^-4^^^^^ as the 

amplification template fo.ii. preparatin of .the linker-amfenablfe- HfiEGF-SAP plasniid 
P232B1. Each of the two Ncol sites contained within the region encoding mature 
HBEGF was mutated in separate amplification reactions. First, a "sense" primer was 
constructed that corresponds tq the nucleotides encoding amino acids 13-19 (SEQ ID 
NO. 1 . nucleotides 37-57 in the HBEGF precursor and includes a EslI site (SEO ID NO 

;-.5-.-CT6QCTGCAGTTCTCTCG6CA-3'.-: <•• ' ■ • 

30 An "antisense" primer spanning the nucleotides that encode amino acids ' 11 4 tb^ 129 in 
the HBEGF precursor %yas designed that imroduces a single base mutatioii (T-*C in the 
sense strand) that destroys an McqI site, while maintaining- a codon for. the amino acid 
histidine at position 1 18 (SEpjD.^JO. 29): , . . . , . - 

! ... .,, \.5j:AGCCCGGAS^ . 
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The antisense primer also spans a Sad site. Amplification of the DNA encoding 
HBEGF using these two primers generates a fragment wth a Psfl site at its 5' end and a 
Sad site at the 3' end. The amplification was. carried out under the same conditions 
previously described in Example 4.C.. except that the second cycle was repeated only 
5 20 times. 

The second Ncol site within the mature HBEGF coding sequence was 
mutated using a "sense" primer that spans the nucleotides encoding amino acids 
124-142 in the HBEGF precursor (SEQ ID NO. 30): 

10 Saci 

5 - GTGAAGGAGCTCCGGGCTCCCTCCTGCATCTGCC ACCCGG6TTATC ATGGAGAG AGG - 3 ' 

This sense primer includes a Sad site, and introduces a singie base mutation (C— >T) 
that destroys the Atol site while maintaining a codon for the amino acid tyrosine at 
1 5 position 1 38 of SEQ ID NO. 1 . 

An antisense primer that spans a region of the HBEGF-encoding DNA 
downstream of the precursor HBEGF coding region was designed to introduce an 
£coRI site adaptor at the 3* end of the amplified DNA fragment (SEQ ID NO. 3 1 ): 

20 . ££QEI - 

. - . . 5NATATAGAATrGTGTCTTCTCA6AGGTA-3' . 

Amplification of HBEGF-encoding DN A using these two primers generates a fragment 
with a Sad site at its 5* end and an EcoRl site at the 3* end. 

25 The amplified HBEGF fragments generated by the two above 

amplification reactions overlap at a Sad site. Following purification using the 
Geneclean II kit (Bio 101), the first product was digested with P^rl and Sad and the 
second product was digested with Sad and EcoRl. The digested fragments were ligated 
into the Pstl and £coRI sites of the vector pGEM-4 (the pGEM series of plasmids are 

30 available from Promega, Madison WI; see also. U.S. Patent No. 4.766.072. which 
describes construction of the pGEM plasmids) producing the plasmid pGEM/HBEGF. 
This plasmid contains a regenerated colinear piece of DNA encompassing the entire 
mature HBEGF coding region (see. e,g., nucleotides 1-234 of SEQ ID No. 33). 

Using this pGEM/HBEGF plasmid as a template^ the mature HBEGF 

35 encoding region was amplified using the primers set forth in SEQ ID NO. 25 that 
corresponds to th^ "sense" strand of mature HBEGF including an Ndel restriction site 
adaptor just upstream of the codon for amino acid 73 of precursor HBEGF. The other 
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" primer corresponds to the "antisense" strand of HBEGF spanning nucleotides encoding 
amino acids 143-149 of precursor HBEGF (see, e.g. . SEQ. ID NO. 1 ), and introduces an 
Ala-Met-lNl5a2l restriction site just downstream of mature HBEGF-ehcbding DNA (SEQ 

- • 'IDNO/'32):-'^'^-^ i^i: . - \ Si ■ /v..: 

••'^ " ^5"- 'ATATAeCATGGGTGGGAGGGTGAGeeCATC^^ - '^ ~ • 

Amplification of HBEGF sequences using these two primers and the above 
10 pGEM/HBEGF plasmid as a template, generates a mature HBEGF encoding fragment 
with an tidfilcsite at- ihe 5' end and a unique A'col site at the 3- end;- An aliquot of the 
amplification reaction was run on an agarose gel and the amplified product was purified 
using the Geneclean II kit? - The purified DNA Was then digested with' WJel and Nco\ 
and ligaied into the NdeI/N6oI; sites of pZlBl (digested to remove 
15 as described in Example 4.B.). * ^ ' ' • 

V Following .transfcirmation into the £. co/7 strain NovaBlue (Novagen) 

..'i positive clones were identified by restriction enz\TOe digestion of rhiniprep E>NA. A 
{positive '^clone designated pZ32Bt plasmid ^ was ^sequerife^ Within^ the vector 

sequence (using the T7 promoter primer) and extending through the HBEGF-coding 
20 sequence, the two amino acid Ala-Met linker and intoi th^ saporin sequence. The 
plasmid pZ32Bl gave the desired sequence for ihe- H^ gene set forth 

in SEQ ID NO. 33. The fusion protein encoded by the plasmid pZ32Bl includes 78 
amino acids of HBEGF (including a methibhine introduced by the Ndel restriction site), 
a two amino acid (Ala^lVietj nhicfeVland 253 iamino acid^ of sapori^ (SEQ l£) NO. 33). 
25 The fusion protein is therefore' 333 amino acids long With' a' predicted molecular weight 
of about 37.6 kD arid an isoelecfric point of 9.6. i* - ' " = 
' ' ' The resulting Hhker-amenable HBE(jF-SAP plasmid ^p 
' from the HBEGF-SAP encoding plasmid PZ31B1 described in Exarnple 4:D. in the 
"'following ways: " ' • • '.^ • / ^ . »-•• -.;} 

30' • 1) ' ■ TwoWcol sites within the coding region for mature HBEGF have 

• ' been 'mutated (by aniplification) so that the A'cof sites are destroyed without changing 
the reading frame or amino acid composition of HBEGF. " •* * \ ' 

' • " 2) ' The two amino acid linker betu'een HBEGF aiid SAP is Ala-Met 
in the new plasmid cohstrufct pZ32B l . This Ala-Met linker encompasses an Nco\ site 
'35 (theonly ^i^cbl site in the newpl&mid> Therefore, the resulting HBEGF-SAP plasmid 
can be liheariied -by digeslibn with Mrol^ bifferent linkers, w^hich have A^col sites at 
' their ehds/cari then Be inserted between '1 
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The desired linker is then inserted into plasmid pZ32Bl. The resulting 
plasmid is introduced into £. coli host cells, expressed and the ftasion proteins isolated 
as described in Example 4.F. Fusion proteins may also be isolated using the same 
procedures as those described for HBEGF (see. e.g.. International PCT Application WO 
5 92/06705. which is based on U.S. Application Serial No. 08/598.082). 
B. Preparation of a PETSAP-MCS (MCS^multiple cloning sites) 

A SAP cassette plasmid (PETSAP-MCS) Was made that would be 
amenable to insertion of any growth factor sequence downstream from the saporin- 
encoding DNA. 

'0 SAP encoding DNA was amplified (using PZ32B1 as a template) to give 

SEQipN0. 81. The sense and antisense primers respectively, used to amplify this 
SAP fragineni were; - . . ■ 

SEQ ID NO. 54: -TGAGCGMIKCMAieGTCACATCAATCACATrA 
15 ££qRI 

SEQ ID NO. 55: 5 ' TATATGMEnXMGfiCCrrTGGriTGCCCAAATACAT 

ECfiRl tlcQl 

20 The resulting SAP-encoding DNA has an £coRI site at its 5* end followed by an Nde\ 
site that encompasses the ATG .codon. The, 3' end of the SAP fragment has no stop 
codon. and has an A^col site followed by art £coRI restriction site. 

The amplified SAP fragment u-as then digested with EcoKl and 
subcloned into the £coRJ site of the plasmid pGEM-4 (pGEM-4 serves as the source of 

25 the MCS. the pGEM series of plasmids are available from Promega, Madison WI; see 
also. U.S. Patent No. 4.766,072, which describes construction of the pGEM plasmids) 
in such an orientation that the multicloning site (MCS) of pGEM-4 lies downstream (3' 
oO from the SAP -encoding DNA. 

Plasmid pGEMSAP was digested with Pst\ and the ends of the fragment 

30 were blunt-ended. The fragment was then digested with Ndel. thereby generating a 
fragment that contains all of the saporin-encoding DNA and most of the MCS of 
pGEM-4. This fragment was then cloned into the NdeMBamHl sites of pET II a. in 
which the Bam\\\ site had been blunt-ended by filling in with Klenow pohTnerase. The 
resulting plasmid was designated PETSAP-MCS. It has unique SacL SmaL and Sail 

35 sites in the MCS for insertion of DNA encoding a desired linker. HBEGF. or 
combination of HBEGF and linker downstream from (3' oO the saporin-encoding DNA. 
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■• 'EXAMPLE?- • 

. .: V V T 'PR^ARATION OF HBEGF GofliticATESTHATCONTAIN LiNKERs'' ' 

A. Synthesis . ' of r oligonucleotides encoding protease substrates and 
oiigonucleqtidp encoding. flexible linkers . ,. 

. ^ Complementary, single-stranded oligos. in ^^^^^^ 
a protease substrate or flexible linker, have been synthesized either using ^ Cvclone 
10 machine (MILLIPORE. Bedford, I^A).. according the- instructions provided by the 
manufacturpr or, if greater than 80 bases, were made by Midland Certified Reagent Co 
(MIDLAND. TX). The following oligos have been synthesized ahd 'can be introduced 
into constructs encoding HBEGF-SAP or SAP-HBEGF. 

J- Cathepsin B substratelinker: 
5'- CCAT6GCCCTGGCCCTGGCCCTGGCCCTGGCCATGG S£Q ID NO. 38 

2. . .Cathepsin D substrate linker , i, • ' ^ 
5' - CCATGGGCCGATCGGGCTTCCTGGGCTTCGGGTTCCIGG 
GCTTCGCCAT GG -3" SEQ ID NO. 39 
.20,. . , , 3.. . Trypsin substrate linker • • «•■- -^-'Vc . .- ^ • -i ' 

... . 5'-. CCATGGGCCGATGGGGCGGTGGGTGCGGTGGTAATAGV\GT • •• ■•^^ V 

CAGAAGATCAGTGGGAAGCAGCCTGXCTTGCGGTGGTCTC • ' ' ' ' • ' ' ' 
■ GACCTGCAGG-CCATGG-3* .;.-- SEQ- ID NO. '44 : 
■ • -.4: ••Gly4Ser ■,->.■■-.• : '>.:■.■ ■ ■ ' i ■ Vi- L\- ' v- 

25 -5% CCATGGGCGG .CGGGGGCTCT'GCCATGG --3^ SEQ aO-^ " < . . •• . 

:. .5:.. . (Gly4Ser)2 •. - ■ ' ^ ' • • • 

.5*-. CGATGGGGGGC.GGCGGGTCTGGCGGCGGeGGGTG • " '-■ ' - ■ ■ 

TGGCATGG -3" SEQ ID NO. 41 " • ' • • . - .•• 

• L 6r. , •: , (Ser4Gly)4 ■ -• •.. " 

30 .S'.r CGATGGGGTCGTCGTGGTGGGGGTGGTGGTCGTG ' . '«. • :.• - 

.- GGGCTCGTGGTCGTGGGGCTCGTGGTGGTGGGGG'-' ' • - ' •• 
... • GGGATGG- -.3''S.EQ. ID NO'. 42 ••' • 

. ;. . 7. , ., , (Ser4Gly)2 ■ - ■ .• • : ■ . 

• 5- GGATGGCCTGGTCGTCGTGGGGCTGGTGGTCGTC •• • - ' ' • ■ 
35 . ...GGGGGGGATGG t3' SEO I0 N0:.-43 ' - ; ' -.r'. .-' .-. . 

8. . Thrombin substrate linker ' • • ....... 
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CTG GTG CCG CGC GGC AGC SEQ ID NO. 45 

Leu Mali Pro Arg Gly Ser 

9. Enterokinase substrate linker 
GAC 6AC GAC GAC CCA SEQ ID No. 46 

5 Asp Asp Asp Asp Lys . - 

10. Factor Xa substrate 
ATC GAA GGT CGT SEQ ID No. 47 
He Glu Gly Arg 

Preparation of DNA constructs encoding HBEGF-Linker-SAP 

10 HBEGF-Ala-Met-SAP (PZ32B1) was digested with Nco l for insertion of linker 
sequences. The following linkers have been inserted: cathepsin D sensitive site, 
diphtheria toxin trypsin sensitive site, and Gly4SerGh'2SerGly4SerGly4Ser. which ma} 
enhance binding of the fusion protein to the receptor compared to fusion proteins 
lacking such linker. 

1 5 C. Preparation of SAP-Aia-Met-Ala-HBEGF 

HBEGF-encoding DNA was amplified (usin^ PZ32B1 as a template) to 
produce a Ncol site at the 5' end and a stop codon followed by a Sail site at the 3* end. 
The sense and antisense primers, respectively, used in ihe amplification reiaction were: 
SEQ ID No. 52: 5 ' -TATATGmiGGCCAGAGTCACTTTATCCTCCAAG ' 

20 

SEQ ID No. 53: 5 ' -TATATGICGAQATGGGAGGCTCAGCCCATGACA 

Sill Stop 

The resulting amplified product was digested with Nco l and Sail and ligated into 
tj£Ql/SalI digested PETSAP-MCS. The resulting plasmid (PZ36B1) encodes a protein 
25 with an Ala-Met-Ala linker between the SAP and HBEGF moieties (SEQ ID No. 82). 

D. Preparation of SAP-Ala.Met-(Gly4Ser)4-Ala-Met-Ala-HBEGF 

Plasmid PZ36B1 was digested with and a linker encoding Ala-Met- 
(Gly4Ser)4-Aia-Met-Ala was inserted. The resulting plasmid was designated PZ37B1 . 

E. Expression of conjugates with linkers ' , 

30 DNA encoding the conjugates set forth above and summarized in 

Table 3 are expressed above for PZ31B1 using plasmids prepared as described abo\'e 
and summarized in TABLE 3. 

F. Western blot analysis of HBEGF fusion proteins 

All HBEGF constructs have dernbiistrated inducible expression of 
35 proteins of the expected size when analyzed by Western blotting. Using the protocol set 
forth in Example 4. F., purification (to greater than 95%) of HBEGF-Ala-Mei- 
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30 



35 



Alltn^XT f >'^^^^>2-Ala-Met-SAP and SAP-A,a-MeKGly4Ser)x4-AIa-Met- 
Ala-HBEGF has been achieved. Specifically, when purification of HBEGF-Ala-Met- 
(Gly4SerGly2Ser)(Gly4Ser)2-Ala-Met-SAP'wks^ptin.ized;^ n^aterial 
- K?!? "^^^ tv^^pe^s. the first peak eluting at 0.6-0 8 

^ M NaCl (pool A) and the second peak eluting at 0.9-1.6 M NaCI (pool B). Pool B was 
found to contain material whose bioafctivity (on A431 cells) was. ten times more active 
than the material in Pool A. L . : ^ 

G. Biological activity ofHBEGF fusion proteins 

in P7„«,\" •'^^^'^'^'^^'^^^'^^^^^^ by plasmid 

10 . PZ3I Bi) was actiyejn the cell^free RIP assay ^ ' * 

. ...Insertion, of the. (Gly4Sel^)x4>lirtll^ 
nis.on.protem,wi.th cytotoxicity to A431 tells flDso on th^ order of 10-1« -lO'^ M) 
The purified fttsion protein. SAP-Ala-Met^Gly^Ser)x^-Ali-MeUla-HBEGF exhibits 
^ similar, perhaps somewhat higher, cytotoxicity. These two riBfeoF fusion proteins 
have also been tested for their cytotoxicity : (relative to FdF-^XP) using other cell lines 
.ncludmg aortic ^smpoth muscle.ceHs (active);igliobl^toina^^'d medulloblastoma cells 
„ (pctive), SK-MEL melanoma,celIs (somewhat actiVe);^^d^k^^^ lung carcinoma 

^'^^^^'"^^">'^>V .Therefore, there ^ Cell.tyjJe aifferehciTiri^Vf.e^ bShot^^^^ of these^ 
proteins. .... ^ ^- - --^f^^io. " % r^iT^ r c '-'^ 



EXAMPLES 

- - ; . S ^^^i-<^viRus Expression - 

^ . The following proteins have- been expressed in the baculovirus system- 
Met-Cys-HBEGF, Met-Cys-Ala-Met-Ala-HBEGF ^^nker amenable). Met-Cvs-Ala- 
,Met-(Gly4Ser)2-Ala-M^t,Ala-HBEGF,(prepared. by insertion of linker into the U^l 
site of Met-Cys-Ala-Met-Ala-HBEGF). and Met-Cys.Ala-Met-(Gly4Ser)x4-Ala-Met- 
Ala-HBEGF (prepared by insertion of linker into the Ncfil site- of Met-Cvs-Ala-Met- 
Ala-HBEGF). ' ' , , . 

A. Niodification ofHBEGF encoding DN A ; ' - 

Mature HBEGF-encoding DNA fragments were amplified (using 
PZ32B 1 as a template) to give a BamHl site at the.5' end followed bv Met-Cvs codons 
At the 3; end the amplified product had a stop codon, followed .by a //m^flll site. TTie 
p^mersused^yere(SEQIDNOs.56and57.w^spectivd^^^^^ : 
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Sense : . 5 ' - TATATGGAILCTMiiEQIAGAGTCACTTTATCCTCCAAG 

EmjH Met Cys 
Ant 1 sense : 5 " - TATATMGCIKIATGGGAGGCTCAGCCCATGACA 
5 . Hindlll STOP. . . - ; 

The amplified product was digested with BamH l and Hind lll and ligated into 
BamHl////«dIII digested pBlueBaclll (Invitrogen). The sequence of the DNA encoding 
HBEGF in this plasmid is set forth in SEQ ID NO. 83. 
1 0 B. Preparation of a linker-amenable HBEGF-baculovinis vector 

A linker-amenable HBEGF/BlueBac clone was made by amplifying 
HBEGF sequences as above using a ditt^ent sense primer (StQ lU NO. 86): 

6 ' TATAGGATCCTG ATGTGTGr.r.ATGGrr AGAGTr.Ar.TTTATrrTrrAAr^rrA 

15 " EaiDHI Met Cys. Ala Met Ala 

The resulting amplified fragment (SEQ.JD No. 84) has a Bam Hl site at the 5' end 
followed by Met-Cys- Ala-Met- Ala (SEQ ilD NO. 85) codons that encompass a Nco l 
site. .i livii: y. '.- ■■ 

20 

EXAMPLE 9 

/.V \ i\'o Assays for Monitoring the Effects of Conjugates on Smooth Muscle 

Cells 

25 ^ 

In vivo assays monitoring the effects of conjugates on smooth muscle 
cells have been described, for example, in Casscells ei al. (1992) Proc. Natl Acad, 
Sci. USA 89:71 59-71 63-. Siich assays are used herein. 

Balloon catheter .denudation is performed on the left carotid aner\' of 5-6 

30 month old male Sprague^Dawley? rats by intraluminal passage of a 2F Fogari>' balloon. 
Body weights range from 300-350 g the day prior to surgerx . At 0, 3. 6, 9 days after 
balloon injurx . wild-type chemical conjugate HBEGF-SAP (1-10 jig/kg/dose). fusion 
protein HBEGF-SAP ( 1 - 1 0 ^g/kg/dose). or vehicle (0.9%NaCl, 0.1% human serum 
albumin (HSA)) is injected. via the. tail vein. The therapeutic composition is prepared 

35 by mixing the test materials. with appropriate volumes of 0.9% NaCU 0.1% HSA. The 
wild-type , chemical . conjugate .is . supplied, in lOmM citrate, 0J4mNaCl. 
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0.1 mM EDTA, pH 6 at a concentration of l.Omg/ml prior to being prepared in 
appropriate dosages. The fusion protein is supplied in 10 mM citrate. 0.14 m NaCL 
0.1 mM EDTA, pH 6 at a concentration of 0.256 mg/ml prior to being prepared in 
appropriate dosages. On day 14 after balloon 'denudation, approximately 120 hr after 
5 the final dose, animals are sacrificed with an overdose of intravenous KCl under deep 
anesthesia. One hour before sacrifice, animals are injected intravenously with Evans 
blue dye (0:5;ml. 5% in saline) to confirm endothelial denudation. At one and 1 7 hours 
priorito.sacrifice, animals iare injected intraperitorieailly with Bromodeoxyuridine (Brdb\ 
30mg/kg) for quantitation of cellular' >r61iferation. At sacrifice, the arterial tree is 

10 perfused at 80 mm Hg with Hahk^s balanced salt' solution. 15 mM HEPES. pH 7.4 until 
the perfusate from thcijugular is' clear y^f blood: The arterial tree is then perfused with 
2% paraformaldehyde in O.i M Na G^codylater buffer. pH 7.4; fof 15 nimiiles. The 
carotid arteries are then removed, cut into sections, and processed for light microscop> . 
Tissue samples are dehydrated, and -embedded iri paraffin: cut into 4^ sections, and 

15 stained with hematoxylin-eosin and Movat pentachrome kain. Vessels are then 
measured for intimal, medial, and neointimal areas by computerised planimetr>'. Ami- 
BrdU antibody .is used for detection^ of BrdU positiv^^ smooth rnuscle cell 

proliferation is quantitated by counting BrdU positive cells as a percent of tc^tal smooth 
muscle cells. 

20 

EXAMPLE 10 

Effect of HBEGF-Containing Conjugate in Mouse Solid Tumor 
25 Xenograft Model 

j/ \ The /« v/va. mouse solid tumor xenograft model. wHich' assays for a 
compound s ability to inhibit tumor cell proliferation, has been described, in Beitz et al. 

' i}99,2 ) Canccr'Rcs: 52:227-230, For example; wild-type chemical conjugate and fusion 

..,30' : protein. HBEGF-SAP are evaluated for anti-tumor activity against any EGF-receptor 
- expressing tumor subtype, e.g./bladder carcinoma, in a mouse tumor xenograft model. 
Sixiv-ihrcc raihymic. nude mice (25 to 30 g) bearing subcutahebus tuiriors are 
randomized iniq: nine treatment groups (n=7/treatment) and given fbur weekly bolus 
intravenous-injections of wild-type chemical conjugate HBEGF-SAP (0.5 ^g/kg and 50 

• 35 ^lag/kg). fusion protein HBEGF-SAP (0.5,:5vO,' and 50' jig/kg)/ SAP orily-(85 Mg/kg). 

.f / HBEQF only (50 (ig/kg): SAP w^ith HBEGF (85 and 50 (ig/kg.respectivelyy.W vehicle 
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20 



25 



30 



35 



(PBS w,rt 0.,./. BSA). Dos,„g n,aKrial U p,^ by nixing A. material «H,h 
appropnalc v„l™,es of PBS/0.1% BSA ,o achieve «,e desi«d doses, individual 
synnges are prepared for each animal. Mice receive four weekly IV injections (250-300 
uI),„,oU,e,a,l vei„„ndays 5. 12, 1 9 and 26 Wirt day 1 designatedias lhe dav to rte 
■umor ecus are .njeced in,o ,he mice. Doses are individualized for '^ffere^es in bodv 
weight. Tumor volume is measured twice weekly for a period of 61 days. 

B^'^'^ »*ymic mice (Roger Williams Hospital Animal 

Facthty, Providence, RI), 8-12 weeks old. are maintained in an aseptic environment 
Stxty-three ammals are selected for the study such that body weights range from ^5-30 
grarns the day prior to dosing. Animals are maintained in a quarantined room and 
handled under aseptic conditions. Food and water are supplied ad l.bUum Utroughout 

Appropriate tutnor cells are obtained from the American Type Culture 

^I'Z'f^TT' ""^ •^•"^ -PP'^ented 

wtth 1 0% fetal calf serum. Five days prior to injection of .he test material, mice teceive 
a subcutaneous injection of tumor cells (approximately 2x10* cells/mouse) in the right 

rear flank. . * ^ & 

Calipers ar^. used to mefasure the dimensions of each tumor. 
Measuremems (mm) of maximum and minimum width are performed prior to injection 
of the test material and at bi-weekly intervals for 6T days. Tumor volumes (mm^) a.^ 
computed using the formula Volurf,e=[(minimum measurementAn,aximum 
measurement)]/2. The results indicate-thm the HBEGF-containing conjugates 
substantially inhibit tumor cell proliferation m \7v«>. 

EXAMPLE 11 

PURinCATION OF FPHSS AN0FPSH2 

Purification steps 1 and 2 are performed with crude material bein- 
incubated on ice and all other steps are performed at room temperature using FPLC 
units/B.opilot (Pharmacia) equipped with P6000 pumps. Fractions/pools of the 
recombmant mitoioxins from the various chromatography steps are analyzed bv SDS- 

PAGE. ! ■ . . ;. - . . 

Step J: Preparation of cell extract. Cell pkste (900 - 1 200 g wet weight ) 
IS suspended in 3-4 volumes of ice-cold cell lysis bufifer.' (lOmM sodium citrate pH 
6.0, containing 1 M urea, 5 mM EDTA: 5 mM EGTA and 50 mM NaCl) and passaged 
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3 times through a microfluidizer (Microfluidics Corp.. Newton. MA, U.S.A.) at 18.000 
lb/in\ TTie resultant mixture was diluted-to ia.volumes^\^ 

\^<^4f<k^^dAdmmionChromt&^-dpky (EbAC). Crude cell 
" - -extriact is loade<i.onto an .e^cp^nded b^^ (300 ml r6sin in a 5 x VoO cm column) of 
5 ' Streamline SP cation - exch^^ which is previously equilibrated With Wsis buffer 

" at 70 ml/min upwards fl^w. , After Joading,, the resin is 'Wash^d witl^'&e same buffer 
until the resin appears clear. ,The plmiger is thenrsloivly (t'. 2 cm/min) lowered. When 
the plunger nears the exppded bed and the Ai,,'decreases tb ieroJthy 'iio^^^^ stopped 
and the resin is .^l.owed^,tQ^5^c,, ..Ctececthe-^unger-ls 0.5 cW^frofri'the packed bed 
10 proteins- are eluted using 2, bufifers^containing incteasing^ NaGl cdrtc^ntrations. The 
column is first washed with.bu% mM soditirti phosphate!^ pH y.b: cbntaining s 
• mM EDTA. 5 mM EGTA and 0.25 M NaCl). After A^^o reached ieti^J burt^^^ (buffer 
A with 0.8 M NaCl ) is^^plied.. This . eluate £ontairis ' the conjugate which is 
subsequently diluted 1:3 <y/v) YdtH..bu<rcr. C <biiffcr A'^cdntamini--ii^'tel) before 
15 being subjected to anion-exchange chr^ . J ; : - - ' 

, .. • -1 " • . ■ <■-■ ' ' ' ' 

Step 3: Q-Sepharose anion-exchange chromalographv and SP- 
Sepharose cat ion-exchange chrormtpgraphy,.- O^SephaWye femoves contaminating 
pNA. as welLa? endotoxin.r:;T^ from : 

. - the prfevioiis step is loaded .onto a :,cplumn;:(2.6- x T ^ift)" of Q-Sfe^^^^^ 
- connected to a column .a.6 x,. p), of SP-sepharose H previously 
• ' • equilibi^tecl in tandem, with buffer, A CGntaining 0.2 'M NacV. ' As the pi' of the 
conjugate 'is above 9.5. it does, pot bind to- the, Q-Sepharose resin; but directh flows 
through and binds to S-Sepharose resin. When the A^go reaches zero, the anion- 
exchange column is disconnected ftpm the cation-exchange column. Proteins bound to 
the S-Sepharose column are. eluted with agra.dient (TO'column volumes) of 0.25 to I M 
NaCl in buffer A. 



ii 20 



25 



Step 4: Hydrophobic Interactions. Chromatography (HlCj. Fractions 
30 ■ cdhtaining the conjugate are pooled, and solid ammonium sulfate is added (on' ice and 
^ stirring) over a period of 1 5-20 min tp a. final concentration of 2 M. the pool is passed 
through a 0.8 M filter and loaded onto a Phenyl-Sepharose HP column (2.6 x 10 cm) 
previously equilibrated with buffer D ( 1 0 mM sodium citrate, pH 6.0. containing 1 mM 
EGTA. J mM EGTA: 1 mM EDTA and 2 M (NH,),SO,). When the K,^ reaches zero 
35 bound proteins are eluted using a gradiem (10 column volumes) of 2 to OM (NH4),S04 
in the above buffer. Proteins in the various fractions are visualized by both SDS-PAGE 
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and Western blotting. There appears to be distinct pools of conjugates (3 in the case of 
FPHS5 and at least 2 in the case of FPSH2). In the case of FPHS5, two pools (A and 
B). are made. Monothioglycerol (MTG. 10 mM final concentration) is added to both 
pools. Pool A is dialyzed against formulation buffer (10 mM sodium citrate. pH 6.0, 
5 containing 0.1 mM EDTA and 0.14 M NaCl) and subjected to size-exclusion 
chromatography (Step 5). Pool B is dialyzed against buffer E (10 mM sodium 
phosphate buffer, pH 8.5, containing 5 mM EDTA and 5 mM EGTA) and subjected to 
another cation-exchange fractionation (Step 4). 

1 0 Step 4B: S-Source cation-exchange chromatography^ Pool B is applied 

to an S-Source colimm (2.6 x 5.7) previously equilibrated with buffer E and bound 
proteins are clutcd with a gradient (10 column volumes) of 0 lo I M NaCl in buffer E. 
Two pools (C and D) are made on the basis of SDS-PAGE analysis. Pool C contained 
an £. coli a major contaminant (- 25-27 KDa). which is difficult to remove. Pool D is 

15 then subjected to size-exclusion chromatography (Step 5). 

Step 5: Size exclusion chromatography. Both pools are passed 
separately through a suitably sized (i.e.. the sample load volume is 10-15% of the total 
column volume) column containing SI 00 resin previously equilibrated with formulation 
20 buffer. From 1 kg. of wet weight paste, approximately 50 mg of purified FPHS5 (Pool 
A) and 10 mg of Pool D are recovered. At .least 3 isoforms are apparent. Various 
analytical methods reveal the conjugates to be over 95% pure. 

EXAMPLE 12 

25 Purification of FPHS2 and FPHSH 1 

These fusion proteins were essentially purified in the same manner as in 
Hxamplc 1 . except that a heparin sepharose FF afRnity column was used prior to step 5. 

Since modifications will be apparent to those of skill in this art. it is 
30 intended that this invention be limited only by the scope of the upended claims. 
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SEQUENCE LISTING 

(1) GENERAL. INFORMATION: 

, .(i). APPLICANT:^ v'O^arbara.Sosnowski 1.4-=^;. . • .* / 

Lois. Chandler 

*• • ' ** " L. ' L. ' Houston 

' "* i..'. '■. Michael. ^Nova ' / / \ y:^-'-^) • p' ' . • 

, ... . John R. McDonald - . 

(ii) TITLE OF INVENTION: CONJUGATES . OF HEPARIN- BltolNG EPIDERMAL 
GROWTH FACTOR -LIKE GROWTH FACTOR AND TARGETED AGENTS 

,-(iii) .NUMBER OF SEQUENCES:- . 87 i- * 

' (iv> COlUlESPONdENCE 'address : ' ' ^ ^ 

• r ... • (A) ADDRESSEE :v Seed. & Berry ' ). ■ 
(B) STREET: (5300 Columbia Center 
- " ■ • ' (C) CITY: Seattle^ ^- ' ' '* ' ' '''''' " " 
: , . (D), STATE. : Washington/! ' ' ' ' • 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092* '* 

(v) COMPUTER READABLE FORM: 
' (A) MEDIUM TYPE: "'Floppy disk " ' 

-o! 'l o : - COMPUTER : ' IBM PC compatible' ' ^ ^ - 

.(C) OPERATING SYSTEM.: , PC 7POS /MS -DpS. , , . f- ..', w-;rt..,-u-\ r ' 
• . ^ rr: ; ; - SOFTWARE :' Piit'entlii 'Release 0,' Version #1 .25 ' 

. , (vi) CURRENT APPLICATION DATA.: . ,^ , ; - : , . 

(A) APPLICATION NTJMBErV ' ' " ' ' * * ' " ' 

(B) FILING.,DATE::' . * '-r^i • • L ' ' 

(C) CLASSIFICATION: 




(vi) PRIOR APPLipVTI.ON DATA:: ,^r. , . . - 

(A) APPLICATiON NUMBER:' 08/297,961 

(B) FILING DATE: 29-AUG-1994 

(C) CLASSIFICATION: 

,: Jvi) , PRIOR APPLICATION DATA: " . 

(A) APPLICATION NUMBER: 08/213,44 6 
^* * (B) - FILING -DATE':* '^l5*-MAR-1994 ' 
:-.{G),: CLASSIFICATION : . / "r ^ .1 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/213,447 

(B) FILING DATE: 15-MAR-1994 

(C) CLASSIFICATION: 



(2) INFORMATION FOR SEQ ID NO 1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 627 base pairs 
(B> TYPE: nucleic acid 
{ C ) S TRANDEDNES S : doubl e 
(D) TOPOLOGY: both 

(ii) MOLECIXLE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(D) OTHER INFORMATION: /note "human HBEGF precursor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAG CTG CTG COG TCG GTG GTG CTG AAG CTC TTT CTG GCT GCA GTT 4 8 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 

1 5 10 ^ - 15 

CTC TCG GCA CTG GTG ACT GGC GAG AGC CTG GAG CGG CTT CGG AGA GGG 96 
Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 



CTA GCT GCT GGA ACC AGC AAC CCG GAC CCT CCC ACT GTA TCC ACG GAC 
Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

CAG CTG CTA CCC CTA GGA GGC GGC CGG GAC CGG AAA GTC CGT GAC TTG 
Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 



144 



192 



CAA GAG GCA GAT CTG GAC CTT TTG AGA GTC ACT TTA TCC TCC AAG CCA 240 
Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lys Pro 
^5 70 75 80 

CAA GCA CTG GCC ACA CCA AAC AAG GAG GAG CAC GGG AAA AGA AAG AAG 288 
Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

AAA GGC AAG GGG CTA GGG AAG AAG AGG GAC CCA TGT CTT CGG AAA TAC 336 
Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

AAG GAC TTC TGC ATC CAT GGA GAA TGC AAA TAT GTG AAG GAG CTC CGG 384 
Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 ^ 

GCT CCC TCC TGC ATC TGC CAC CCG GGT TAC CAT GGA GAG AGG TGT CAT 432 
Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

GGG CTG AGC CTC CCA GTG GAA AAT CGC TTA TAT ACC TAT GAC CAC ACA 480 
Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 

150 155 160 

ACC ATC CTG GCC GTG GTG GCT GTG GTG CTG TCA TCT GTC TGT CTG CTG 528 
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Thr He Leu Ala Val Val Ala Val Val Leu SerSer Val Cys Leu Leu 



vlf Til ^ '"'■^ ""^ «T AGG' AGA GGA GGT TAT 

val He val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly ^ 

GAT GTG GAA AAT GAA GAG AAA GTG AAG TTG GGC ATG ACT AAT TCC CAC 

ASP val Glu Asn Glu Glu Lys Val l,ys Leu Gly Met^ Thr sir 

200 205 / 

TGA •^' ' 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTER I STXCS : . ' "* • ' 

(A) / LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(D).. TOPOLOGY: linear; . ^ , - ^ * 



(ii) MOLECULE TYPE: protein 

(ix) FEATURE:."' ' " ' - . . ^ - - 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

. (D) OTHER .INFOR>lATION: /note-*' human i^^^ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Lys Leu Leu Pro S^x Val Val :Leu Lys Xeu 'Phe lieti ^AXk Ai^ ' 
. 5 • 10 



Ala Val 

15 



Leu^Ser Ala L^u Val Thr Gly .Glu S.er Leu Glu Arg Leii A^g Xr^ Gly 

. - .,*:. * . . \ : t: . '.25 * : i : s . ' 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 



4 5 



Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys vkl Arg Asp L^u 

Gin Glu Ala As?; Leu Asp Leu, Leu Arg Val Thr ' Leu- Ser Ser Lys Pro 

''^ ^ 75 80 

Gin Ala Leu Al a, Thr Pro ; Asn Lys Glu Glu His- Gly Lys Arg Lys Lys' 

?? : ' <. . -90 '■• 95 • 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 

. ' - . a05; . 7 - . -110' \' ^ 

Lys Asp Phe cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 

lis 150 ^ 



576 



624 



627 



125 
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Ala Pro Ser Cys lie Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
145 li50 155 160 

Thr lie Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

Val lie Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



(2) INFORMATION FOR SEQ ID Np:3:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

<D) OTHER INFORMATION: /note "human mature HBEGF" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Arg Val Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr Pro Asn Lys 
15 10 15 

Glu Glii His Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys 
20 25 30 

Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys lie His Gly Glu 
35 40 45 

Cys Lys Tyr Val Lys Glu Leu Ar^ Ala Pro Ser Cys lie Cys His Pro 
50 55 60 

Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: un3cnown 

(D) TOPOLOGY : unJcnown 
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<ii) MOLECULE TYPE: protein 
(ix) FEATURE; 

i..(D) OTHER INFORMATiOiJ : /note "monkey HfiEGF precursor" 

f: \\ : ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Leu Leu Ala Ala Val 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Gin Leu Arg Arg Gly 
20 25 _ 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Ser Thr Gly Ser Thr Asp 
35 40 45 

Gin Leu Leu Arg Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 

50 55 _ 60 . .. . . . ... 

Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser!. Ser Lys Pro 
65 70 75 . , 80 



Gin Ala Leu Ala Thr Pro Ser Lys Glu Glu His Gly' Lys Arg Lys Lys 

85 90 ^ :r-^5.. 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys -Tyr , 
100 105 , ... . . iio, ' 



_ i. f'u- 



Lys Asp Phe Cys lie His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 

.... C:-: 

Ala Pro Ser Cys lie Cys His Pro Gly Tyr His ply Glu Arg ,Cys His 

^ . '13 0'- ^" -13 5 * " ' ' ''140 ' * " " 

Gly Leu Ser Leu Pro Val Glu. Asn Arg Leu Tyr .Thr. Tyr. Asp His Thr- 
145 - • . • ^ ' '=150' ' ^ • ' ' i55" 160 

Thr lie Leu Ala Val Val Ala Val Val Leu Ser Ser Val pys Leu Leu : 
• ' -'' - *165- - " - ' ' ' 170 ' ' ' 175 

Val lie Val Gly Leu Leu Met Phe Arg Tyr His. Arg Arg Gly^.Gly Tyr.. . 
- • ' i 180 ^ - ' ' 185 . 190 . 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr As.n .Ser His, 
195 ' 200 * /: '205 

(2) INFORMATION FOR SEQ ID NO: 5: ^ . . . 

(i) SEQUENCE CHARACTERISTICS: , , 

<A) LENGTH: 208 amino acids \ 'i. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown [ . . j 

(D) TOPOLOGY: unknown - 

(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

CD) OTHER INFORMATION: /note «rat HBEGF precursor- 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
1 5 - 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Ala Thr Ser Asn Pro Asp Pro Pro Thr Gly Thr Thr Asn 
35 40 45 

Gin Leu Leu Pro Thr Gly Ala Asp Arg Ala Gin Glu Val Gin Asp Leu 
50 55 . 60 

Glu Gly Thr Asp Leu Asp Leu Phe Lys Val Ala Phe Ser Ser Lys Pro 
^5 70 75 80 

Gin Ala Leu Ala Thr Pro Gly Lys Glu Lys Asn Gly Lys Lys Lys Arg 
85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Lys Lys Tyr 
100 105 110 

Lys Asp Tyr Cys He His Gly Glu Cys Arg Tyr Leu Lys Glu Leu Arg 
.115 , . ^ 120 125 

He Pro Ser Cys His Cys Leu Pro Gly Tyr His Gly Gin Arg Cys His 
130 135 140 

Gly Leu Thr Leu Pro Val Glu Asn Pro Leu Tyr Thr Tyr Asp His Thr 

150 155 160 

Thr Val Leu Ala Val Val Ala Val Val Leu Ser S6r Vai Cys Leu Leu 
165 170 175 

Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Leu Glu Ser Glu Glu Lys Val Lys Leu Gly Met Ala Ser Ser His 
195 200 205 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(ix) FRATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: .1. .1002. - ,. 

(ix) FEATURE: c- . -. • 

(A) NAME/KEY: CDS 

(B) LOCATION: ,1. .627. , , v :: - . . ' 

' ^D)&inSR iNFORMATION: /note "HBEGF Val Met Saporin" 

(xl) 'SEOUEliiCE DESCRIPTION: SEQ IDNO:6: 

ATG AGA GTC ACT TTA TCC . TCC AAG CCA . .CAA .GCA- CTG GCC ACA CCA AAC 
Met Arg Val Thr Leu ^Ser Ser Lys Pro Gin- Ala Leu Pro ^1 

10 15 

^- t^^- l^^^'''' 2f aW' aAG AAG A^uv'sisc AAG GGg' CTA GGG AAG ' 

^y. G.. G.a Hxs Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys 

... .-25 .i -J r-.-. .•■■30:-- 

AAG AGG GAC CCA TGT CTT CGG XaA TAC AAG GAC TTC TGC ATC CAT GGA 
I.ys Arg ASP Pro Cys Leu. Arg Lys Tyr. Lys- Asp ,.Phe C^s SI Sy ■ 

GAA TGC AAA TAT GTG AAG . GAG , pTC CGG GCT CCC TCC TGC ATC TGC CAC 
Glu cys Lys -Tyr val Lys Glu Leu Arg Ala Pro Ser Cys Xle '^s Ss 



60 



CCG GGT -TAC CAT GGA SAG AGG TGT CAT GGG CTG AGC CTC CCA GTC ATb 
Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro vll mH 

...O - . ; ^° V/.:' : ■ ■■■■ ■■• >•• ^ 75 ^-^-^ : ^ ^.C; .^^ 

III Jhr IT. rT ^T '^'^ ^''^ «<=^ ^ TAC 

Val Thr Ser He Thr ^^u Asp Leu Val Asn Pro Thr Ala Gly Gin 

ser ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu 
100 105 



110 



AAA TAC GGT GGT AcC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA 
I^ys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser ^ 

... . 12,0 . • 125 ■ 

AAA TTC CTT AGA ATT AA^" TTC CAA AGT TCC CGa' GGA ACG GTC TCA CTT^ 
Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu 
130 135 



140 



GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG:TAT CTT GCA ATG GAT " 
Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met-'Asp ' 

"0 155 . V ; - :;i66 ' 

Jk"" ""T '^'^ """^ """^ ^ GAA ATT ACT TCC 

Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser^ 

170 3^75 



48 



96 



144 



192 



240 



. 268 



336 



364 



432 



480 



528 
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GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA 576 
Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys 
180 185 190 

GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG 624 
Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin 
195 200 205 

ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC 672 
He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp 
210 215 220 

TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT 720 
Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val 
225 230 235 240 

AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG 768 
Lys Asn Glu Ala Arg Phe Leu Leu lie Ala He Gin Met Thr Ala Glu 
245 250 255 

GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC 816 
Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro 
260 265 270 

AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG 864 
Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp 
275 280 285 

CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT 912 
Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe 
290 295 300 

AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC 960 
Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp 
305 310 315 320 

TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 1002 
Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
325 330 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

He Arg Val Arg Arg 
15 
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(2) INFORMATION FOR SEQ ID NO:8.: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
(B,)r-.TYP?:: ■ nuclexcsacid . 

( C) -^STRANDEDNES : , double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE ..TYPE : ;CDNA: V 

(ix) FEATURE : :] ^ 

(A) NAME/KEY: CDS 

(B) LpGATION :/!>... 804 - 



(ix) FEATURE: ^. 

(A) NAME/KEY: misc_feature 

(B) , •LOGM'IOU,; I'; . 004' ': • . W-r- :.' ' ' T ' >• • • - 

(D) OTHER INFORMATION :. 7note= "Nucleotide sequence ' ' 

corresponding to the clone M13 mpl8-G4 in Exan^le I.B.2." 

^ (ix) FEATURE;:-^ , :,y . ::r'* ' "i - ; - . 7 . ' ' 

(A) . NAME/KEY: ma.t_peptide 'i- . ■ ^ ' • ' 

(B) LOCATIQNc 4 6.. 8 04 

(D) OTHER INFORMATION: /product* ""Saporin"" 

(xi) SEQUENCE.DESGFIIPTION.; :SEQ:ID^N0:8: ■ ^J- ' 

GCA TGG ATC CTG CTT CAA TTT TCA GOT TGG ACA ACA ACT GAT GCG GTC 48 
^^^.r.?^ Gin Phe Ser Ala Trp Thr Thf tUr Asp Ala Val ' 

•15 J • * -10 ; % : • ..c; r- : ' - . .:*.' i^^: " ^ 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 

Thr Ser lie Thr Leu ^sp Leu Val Asn Pro Thr^ Ala Gly Gin Tyr Ser ~ ' ' 

c;:^, ^ i^' : > 10 /'-■^ -is ' ' •"•'^ 

TCT TTT GTG GAT AAA ATC CGA AAC AAT GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg. Asn Asn Val Lys Asp Pro Asn Leu= Lys 

20 -^5 ■•: . ; V "30 ■ " 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT -TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 24 0 

Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly ' 

50 55 60- . : - - ' . 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala • ' 
85 90 95 
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GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyx Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 145 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA bVb 
Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : doiible 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAJyiE/KEY: CDS 

(B) hOCATlO^i 1. .804 
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(ix) FEATURE: 

(A) ' NAME/KEY: misc_f eature 

(B) LOCATION: 1., 8 04 



(D) OTHER INFORMATION: /note= "Nucleotide sequence . , 
•' ' 'corresponding to the clone Ml 3 mp 18^- Gl in Example I.E. 2. 

(ix) FEATURE: ^' 

(A) NAME/KEY: mat_peptide 

(B) ^ LOCATION: '46. .iB04'*^ * ' ' ' ; ' !' 
(D) OTHER INFORj^TION: * /products "Saporin'! , * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : ? : _ , . . 

GCA TGG ATC CTG CTT CAA TTT' TCA GCT 'TGG' ACA * ACA ACT GAT. GCG GTc' 46 
Ala Trp lie Leu Leu Gin Phe Ser Ala^ Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 , . . 1 , 

ACA TCA ATC.ACATTA'GAT GTA -GTA' AAT • CCG ^ACC GCG GGT CAA TAG ".TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 _ . ' • . ... ... . 

TCT TTT GTG GAT : AAA ATC ' GGA " AAC ^AAC ' GTA ' AAG ' GAT CCA MlC CTG 'AAA . 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 

20 25 30 , ... - ^ ^ . . , 

TAC GGT GGT .-ACC GAC ATA GGC 'iGTG ATA GGt CCA CCT ^TCT AAA GAA AAA ' J . 192 
Tyr Gly Gly Thr Asp lie Ala -Val lie Gly Pro Pro Ser Lys Glu Lys 

35 ^ ....... , -,^f-.- rv-. rJ-'J- O/- 

TTC CTT AGA ATT AAT TTC CAA AGT tCC CGA GGA ACG GTC TCA CTT GGC 24 0 

Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly . . 

SO- - '"SS '-'^ ■ ''' ^f '''60 7^^' ""' " . ^*^5 ...... 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala I^et Asp Asn 

i ■ 70 ^ 75 ' *V ' BO \ I . ' 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu lie Thr Ser Ala 
85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG.AAA .GCT . . 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala " 
100 105 ' ' 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAGIaTA" 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC. TTA , . 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp^Leu . * 
130 135 140 - '^14^1^ 
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CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT.ATT CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Val 
165 170 175 

GCA CGA TTT CGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTP GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGA GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 
230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..804 . 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 inpl8-G2 in Example I.B.2." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46 . . 604 

(D) OTHER INFORMATICTI : /product. - Sapor in" 
(xi) SEQUENCE , DESCRIPTION: SEQ ID NO:10: 
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GCA TGG ATC CTG CTT CAA TTT: TCA GCT TGG ' ACA ACA ACT GAT GCG GTC / , 4.8 

Ala Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 

-15 -10 . -5 1 

ACA TCA ATC ACA; TTAvGAT:;CTA: GTA^^^ CAA TAC TCA .'I 96 

Thr Ser lie; ytir.,i:^U;,Asp Leu^ Vdl ■ A^ 

5 : V. 10 15 

TCT TTT GTG GAT - |JUl :ATC :CGA /AAC--" AA - GAT 'cCA AAC CTTG" AAA ' , " 144 

Ser Phe Val Jf^p - Lys , 11^ Lys-TAsp Pro ' Asn Leu Lys ' 

20 25 30 

TAC GGT GGT ACC^ GAC ATA GCC/ GTG ATA GG^C CCA ' CCT TCT AAA GAT AAA \ " .192 
Tyr Gly Gly .Thr ,Asp -lie Ala .Val cll^ Gly Pro Pro Ser Lys Asp Lys ' 
35 40 ' 45 - 

TTC CTT AGA ATT AAT lTTC JCAA AGT /TGC "GSA GGA ACG GTC TCA tTT GGC: \ " 240 
Phe Leu Arg lie Asn Phe Gin iSer Ser Arg Gly Thr Val Sear Leu Gly 
50 55 60 ' ' 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC QCG TAT CTT GCA a¥g GAT AAC \ * 288. 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
•70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC "\ 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe L^^ Ser Glu 116 Thr S^r Ala ' 
85 90 ' 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT' QCA*' AXt CAG &kk GCt " " " ' 384 ' 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala^ 

100 105 - /-oiff^.-'J^'^ ■^•^•-•--^ 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG 'AtA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin tie 
115 120 125 - ' - 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG* ATC GAC TTA' ' '4 80 

Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 .145 / 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG * GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA ' 576 
Asn Glu Ala Arg Phe, Leu : Leu lie Ala lie Gin Met Thr Ala Glii Val ' 
- 165 . . - 170^ ' - 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC * 624 
Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser '"^rp Arg ' 
195 200 205 
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AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 22S 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 ■ . 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly I^u Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs . • - . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

{ix) FEATURE: * \ 

(A) <' NAME/KEY: misc_feature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note= "Nucleotide sequence 

corresponding to the clone M13 mpl8-G7 in Exan^le I.E. 2." 

(ix) FEATURE : 

(A) NAME/KEY: matj>eptide 

(B) LOCATION: 46 . .804 

(D) OTHER INFORMATION: /product* "Saporin" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 46 
Ala Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 
5 10 15 . 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 ^0 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
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Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys 

35 ' / . . . 40 , 45 - . • . 

TTC CTT AGA.ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC ^TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val -Ser Leu Gly - 

50 '/ ^-..55^- - ■ 'j-'eo::. ' 'V • ■ ; V-'.es' 

CTA AAA CGC GAT-AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met , Asp -Asn 

70 >-^'^ '75/; ' ■ so.; ' r 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser- Ala =. . : 

85 90 ' 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA. AAT CAg" AAA GOT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr thr Ala Asn Gin Lys . Ala 

105 '* ' 110. . ./ 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA jS^G AAt' GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin -He 

115 120 125 ■ - - ' 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA . 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu. Gly He Asp Leu 

135 140 .Z" ] ; .145. 

CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT- jyjLA . .528 
Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155^-' :"^:l6a : 

AAC GAA GCT AGA TTC 'GTT ' CTT -ATC* GCT " A 576 
Asn Glu Ala Arg Phe Leu Leu lie ' ASia "li^ Gl^^ Met Thr* Ala Glu Ala 
165 170 175 ; 

GCA CGA TTT AGG TAC ATA CAA AAC TTG GTA ATC AAG" AAC TTT, 6CQ AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys . Asn Phe Pro Asn » 

180 ' - 185 • i90 ' 

AAG TTC AAC TCG GAA AAC AAA GTG ATT CAG TTT GAG GTT AAC TGG AAA ^ 672 
Lys Phe Asn Ser Glu Asn Lys Val lie Gin Phe Glu Val Asn Trp Lys 

195 200 205 . . , . . . . 

AAA ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC Gt6 TTT AAT 72 p» 

Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 

210 215 220 . 225 .. . , 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AG<3 CAG dTG AAG GAC TTG ' 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 _ 240 , .. . - - . 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA tCA AAG " . ' 804 

Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 
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(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1^804 

(D) OTHER INFORMATION: /note = "Nucleotide sequence 

corresponding to the clone M13 mpl8-G9 in Example I.B.2. 

(ix) FEATURE: 

(A) NT^ME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /products « Sapor in" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp lie Leu Leu Gin Phe S^r Ala T?:p Thr Thr Thr Asp Ala Val 

llO v / . ' - . -5 1 ' 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

'5 ' ^. .10 : . . 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pto Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 

50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 
70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu lie Thr Ser Ala 

85 90 - 95 
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GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 

100 lOS -■- ■ • 110 - - 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATT GAA 'AAG- AAT' GCC fcAG ' AT^A 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Giu-iiys A^'h' a14 filn- lie 
115 120 :~1.2^.. i ; - 

ACA CAA GGA GAT CAA AGT AGA AAA GAA CTC GGG TTG'gGG ATT Ga6 T^-a' 
Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 

135 140 •:i4'5' 

CTT TCA ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 
Leu ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 
ISO 155 160;. . 

GAC GAA GCT AGA TTC CTT CTT ATC GCT ATT CAG ATG ACG GCT cSag' GCA 
Asp Glu Ala Arg Phe Leu Leu,.:lle :Ala He Gin Me f 'Thr kia Glu' Aia ' 

• , . . 1,S5- . . . r - ,. ,170- v;.;- . -J -,5. c- 

GCG CGA TTT AGG TAC ATA CAA AAC TTG GTA ATC AAG AAC TTT CCC AAC 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val lie Lys Ash Ph^ Pro Asn 

180 185 ■ - 19b ; • / 

AAG TTC AAC TCG GAA AAC AAA GTG ATT CAG TTT GAG GTT AAC TGG AAA 
Lys Phe Asn Ser Glu Asn Lys Val IleCQln^Phe Glu Val A^n Trp Lys 
195 200 205 



AAA ATT TCT ACG GG^ ATA TAC GGG GAT QCG AAA AAC 6gC GTG TTT 
Lys lie Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 

215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA, AAA GTG" AGG CAG GTG AAG GAC" ^T^ 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Airg Gin Val Lys Asp t^eu 

235 _ 240 

CAA ATG GGA CTC^ CTT: ATG, TAT. TTG, GGC AAA CCA" AAG~ ' - ' ' ' 

Gin Met Gly Leii Leu Met Tyr Leu Gly Lys Pro Lys 
245 250 

(2) ' INFORMATION ; FOR SEQ „ID..NO: 13 . ■ ' ' - ' - .... 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .^40 amino :acids - 

(B) ,,TYPE: amino acid . • . - . 

(C) -STRANDEDNESS: single- 

(D) TOPOLOGY: xinknown 

(i'i) MOLECULE . TYPE : peptide. -''^ 
(ix) FEATURE: 

(D) OTHER INFORMATION: /product= "SO-4^ ' ' ' ' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



384 



432 



480 



528 



576 



624 



672 



720 



768 



804 



WSDOCID: <WO_96(»274A2_l_> 



wo 96/Q8274 




PCT/US95/12205 



118 



Val He He Tyr Glu Leu Asn Leu Gin Gly Thr Thr Lys Ala Gin Tyr 

5 10 15 

Ser Thr He Leu Lys Gin Leu Arg Asp Asp He Lys Asp Pro Asn Leu 
20 25 - 30 

Xaa Tyr Gly Xaa Xaa Asp Tyr Ser 

35 40 . 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

He Lys Arg Gin Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CATATGTGTG TCACATCAAT CACATTAGAT 30 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAGGTTTGGA TCCTTTACGT T "21 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 22 base pairs 
< .... „(B) TYPE: nucleic acid : ' ' . - - ' ^ 

(C) . STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic)'^ 

(ix) FEATURE: • ^ ' < " . : . ' . 

(A) NAME/KEY: misc^recomb 

(B ) LOCATION : 1 0 . . 1 5 V : . ' * : - - - ' ^ - ^ ^ ■ - ' ' - 

(D) OTHER INFORMATION: /staridafd_name=' "Neb I'' restriction enzvme 
recognition site" ^ - 

(ix) FEATURE: i - • ' 

(A) NAME/KEY: mat_peptide 

\ *-» i -UV-/ W.rx J. X OxM . J. ^ . . 2 2 -• • i ■ J 

(D) OTHER INFORMATION: /product^ "N-terminus of . Sapor in 
protein" -''V .r;:.:.^'- • *. 

(Xi) SEQUENCE DESCRIPTION: SEQ IDNO:17: • ' ' ' ' ' 

CAACAACTGC CATGGTCACA TC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: i::"- ^ r.:,^ O: :i.:"r:.^,- 

(A) LENGTH: 59 base pairs --^-'^ ox^;. m/: ^^^a 

{B) TYPE: nucleic acid ^l:::r^^:. : ? ■i.-mcaini^.-il : • 

(C) STRANDEDNESS: single rv^iii: : \ I i^:) 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: . - '.'^ " ' '. . U -:. .. . / .-K Z - 

(D) OTHER INFORMATION: /product = trp promoter 

AATTCCCCTG TTGACAATTA ATCATCGAAC TAGTTAACTA GTACGCAGCTT TGGCypCAG \ . - 59- 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 59 base pairs - - 

(B) TYPE: nucleic acid * 

(C) STRANDEDNESS: single :.. /.■ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE:. ' ^ . - 

(D) OTHER INFORMATION/product= bacteriophage lambda CII ribosome 
binding site \* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:' ' ' . ; • 
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GTCGACCAAG CTTGGGCATA CATTCAATCA ATTGTTATCT AAGGAAATAC TTACATATG 59 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs • ' ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: IX . . 16 

(D) OTHER INFORMATION: /standard_name»= "Nco I restriction enzyme 
recognition site." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /product* "Carboxy terminus of 
mature FGF protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GCT AAG AGC GCC ATG GAGA 19- 
Ala Lys Ser Ala Met 

1 . 5 : _ : - 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: unknown 

(ii) MOI*ECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
CB) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /products "Carboxy terminus of 
wild type FGF" 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 13.. 18 

(D) OTHER INFORMATION: /standard_name= "Nco I restriction enzyme 
recognition site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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GCT AAG AGC TGACCATGGA OA 21 
Ala I*ys Ser. • . r . 
1 

(2) INFORMATION FOR SEQ ID NO; 22: 

(i) SEQUENCE CHARACTERISTICS : : • I ^ 5 v * i - T - 

(A) LENGTH: 102 base pairs * . 

(B) TYPE: nucleic acid ' '^"'■••^ - 

(C) STRANDEDNESS : double - h 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS ' * . * 

... .. .(B) LOCATION:: 1. .96 r . : ' . V: 

(D) OTHER INFORMATION: /product^ "pFGFI^COI*' : • v 

/note= "Equals the plasmid pFCSO with native FGF 
stop codon removed. " ' • • • ' 

( ix) FEATURE : ^ ^ j: • ' : ' 

(A) , NAME/ ICE Y: misc_recomb - ' : \ ' 

(B) LOCATION: 29.. 34 . - ' - - v;.... 

(D) OTHER INFORMATION: /standard_name= ''Nco I restriction enzyme 
recognition site" ^, : ? \ - . 1 'mi^- ' v.: - ^z,.. ; 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 22: .-.r.-^.^ iO . r- . . /• - 

CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GAG ATC CGG CTG AAT 48 
Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Glu He Arg Leu Asn 

1 5 10 . ■c^' o.: or:^- <r^s 

GGT GCA GTT CTG TAC CGG TTT TCC TGT GCC.GTC'>^ 

102 • , V :> r - . j . ' , 

Gly Ala Val Leu Tyr Arg Phe Ser Cys Ala ValcPhe Gin- Asp SBt'''"'- ^' 
20 25 ; , r: 2q .: " ; 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs '* 

(B) TYPE: nucleic acid 'i- 
iC) STRANDEDNESS: single 

(D) . -TOPOLOGY: linear v. r . . * ' 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23:' . . * 

AAGGAGATATACC ATG .GGC AGC AGC CAT. CAT- CAT. CAT CAT CAC AGC AGC ' ^ 4 3 

Met Gly Ser Ser His His His His His His Ser Ser- - 
15 10 

GGC CTG GTG CC^3 CGC GGC AGC CAT ATG CTC GAG GAT CCG 88 
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Gly Leu Val Pro Arg Gly Ser His Met Leu Glu Asp Pro 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGATCCGCCT CGTTTGACTA CTT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDI^SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTGGACCATA TGAGAGTCAC TTTA 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTATATCATG ACTGGGAGGC TCAGCCCATG ACA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 



20 



25 
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(iv) ANTI -SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6.. 11 .•.^l?:'" 

(D) OTHER INFORMATION: /staridard:rtaine= /Eco^^^^ Site" 

(ix) FEATURE: " "* ' '* ' 

(A) NAME/KEY: sig_peptide ... • - 

(B) LOCATION: 12.. 3 0 

, ^ OTHER INFORI^TION: '^/f unctions "N-teonina extension 

/products "Native Mporin signal peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CTGCAGAATT CGCATGGATC CTGCTTCAAT 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs ^ ^ ^ ^' \ ' 

(B) TYPE: nucleic acid • ' : ^ ' • 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) , 
(iv) ANTI- SENSE: YES 



30 



(ix) FEATURE: 

(A) NAME/KEY: misc^recomb - CI ' * - - 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /standard_.name= \"^co^ Site" 

(ix) FEATURE: ' ^ *^ . v • . ' 

(A) NAME/KEY: terminator 

(B) LOCATION: 23. .25 • . . • . 
CD) OTHER INFORMATION: /note= "Ant i -sense stop codonr 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 26.. 30 - " 

(D) OTHER INFORMATION: /note= "Anti-sense to carboxyl 
terminus of mature peptide"'' * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: * 

CTGCAGAATT CGCCTCGTTT GACTACTTTG ' ' * ' ' .. ^- • 

(2) INFORMATION FOR SEQ ID NO: 29: " - ' * 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid . .v .- ' - • - •* - ' - 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECUI.B TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



AGCCCGGAGC TCCTTCACAT ATTTGCATTC TCCGTGGATG CAGAAG 



46 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GTGAAGGAGC TCCGJGGCTCC CTCCTGCATC TGCCrACCCCSG GTTATC:ATGG AGAGAGG 57 

(2) INFORMATION .FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(.B) TYPE: nucleic acid 

(C) STRANDEDNESS :. single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) ^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ATATAGAATT CTGTCTTCTC AGAGGTA 27 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: single . - - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATATACCATG .GCTGGGAGGC TCAGCCCATG ACA 



33 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

^: (D) TOPOLOGY : -both - ' ' 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY : CDS . " " ' ' / ' " 

(B) LOCATION: 1..1002 ' 
(D) OTHER INFORMATION: /product^ "Lixiker Amenable 

HBEGF-SAP" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

ATG AGA GTC ACT TTA TCC TCC AAG CCA CAA GCA CTG GCC ACA CCA AAC ' 48 

Met Arg Val Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr Pro Asn 

X '\. s: / . • . ■ ■ ^" 10 ' - ■ . ^5 • 

AAG GAG GAG CAC GGG AAA AGA AAG AAG AAA GGC AAG GGG' CTA ^GGG" AAG ' • ■• ' ' 96 

Lys Glu Glu His Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly Lys 

20 25 '''^'^'^''^''.'l^y'''i9^ 

AAG AGG GAC CCA TGT CTT CGG AAA TAC AAG GAC ' TTC^ ^TGC; ATC GGA 144 

Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys "ASp ^Phe Cys lie Gly 
35 40 '45 '-''•^' -•^ ■ 

GAA TGC AAA TAT GTG AAG GAG CTC CGG GCT'''CCC TCC TGC ^ATC TGC 'CAC ' " 192 

Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys lie Cys His 
50 55 60 

CCG GGT TAT CAT GGA GAG AGG TGT CAT GGG CTG AGC CTC CCA GCC ATG 240 

Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro Ala Met 

65 70 75 ' "* 80 

GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC ' GCG GGT' CAA TAC * 288 

Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr 
85 90 '95 



TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA -AAG -GAT CCA AAC CTG 336 

Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val -Lys Asp Pro Asn* Leu 

100 105 110 

AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC ' CCA CCT TCT AAA' GAA * 384 

Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu 

115 120 125 

AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT J*^-^ 

Lys Phe Leu Arg lie Asn Phe Gin Ser Ser -Arg Gly Thr Val Ser Leu ' 
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130 135 



140 



QGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA AT6 GAT 
Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asd 

ISO 

AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC 
Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser 

165 170 3^75 

GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA 
Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lvs 
180 185 190 



AAC AAG TTC GAC. TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG 
Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp 
275 280 285 



AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC 
Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp 

310 315 320 

TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 
IjBu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
325 330 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) .TYPE: nucleic acid 



480 



526 



576 



GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG 624 
Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin 
195 200 205 



ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC 
He Thr Gin Gly Asp Lys Ser Arg Lys. Glu Leu Gly Leu Gly lie Asd 
210 215 220 

TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT 
Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val 
2" 230 235 240 

AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG 
Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu 
245 250 255 

GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG 6TA ACT AAG AAC TTC CCC 816 
Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro 
260 265 270 



672 



720 



768 



864 



CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT 912 
Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe 
290 295 360 



960 



1002 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA \ 

- . ' ' ^ 

(ix) FEATURE: 

(A). NAME/KEY: CDS;' .'• i..-.' .... : ' 

' (B) LQCATJON: 1... .1230 • • r -' - ' ' " - ' ' 

(ix) FEATURE: 

(A) NAME/KEY: matjjeptide - ■ / ^ 

(B) LOCATION: ,-1.. .46.5 ■ - V . . . - ' 
(D) OTHER.. INFORMATION: /product* "bFGP" 

(ix) FEATURE.: ~" ■ - 

(A) NAME/KEY: matj>ept±de 3 ■ • - 

(B) LOCATION: 4572.. 1230 

(D) OTHER INFORMATION: /product^ "Saporin** 
(xi) SEQipiilCEr DESpRlCT : ^ ' . ' ■ ^ ' 

ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC - 48 

Met Ala Ala Gly Ser lie Thr Thr Leu Piro Ala Leu 3Pr6 <31u A^p 'Gl>r ' ; 

^ r :5 p- , - \:. - 10 ' ' '• ■ / ' ■* 15 ' " ' ' * 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG .CTG , 96 

Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu ' ' ■ ' 

20, . :^ 2s -r.r. i-r.x u-j :r-j. i^^ :. ^ 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CCJA. . 144 » 

Tyr Cys Lys Asn Gl:X>Gl:y. Phe. Phe. Leu Arg f l^e- His Pro As^" ^ly ^g ' V ' 7 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT, .192 
Val Asp Gly Val Arg, Glu. I^ys. Ser. Asp. Prb His Il¥ Lys Leii cilh Leu 

'50 ; — • 60-' ' ' *.;. "' 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 2.4 0. 

Gin Ala Glu Glu . Arg Gly VaX Val . Ser ^ lie Lys Giy' Val Cys Ala A^h ^ * 

€5 70 : rr: - 75 - ■ ' 80"' " ' 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT _ 288. 
Arg Tyr Leu Ala Met ;Lys :Glu :Asp Gly Arg -Leu Leu Ala Ser Lys Cys' " ; ! 1 
- . 85 J . -V 90-" '-^^ ' ■ " S^5 ' ^ 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe,, Phe Phe Glu Arg v Leu Glu Ser Asn Asn Tyr"" • ' 
100 - . .105 ' ■ * 110 ' - 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys ' 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG' AAA \ 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
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130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 4 80 

Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 

ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 528 
lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 
1€5 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 
180 185 . 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 624 
Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Glu Lys Phe Leu 
125 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu 
245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 
260 265 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin 
275 280 , 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT" TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu 
290 295 300 

ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys ASn Glu 
305 310 315 320 

GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA 1008 
Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala . Glu Val Ala Arg 
325 330 335 

TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC ' 1056 
Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn" Lys Phe ' ' 

340 345 350 
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GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 
Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He 

355;. : 360 - 365/ ; . . i ' 

TCT ACG GCA>TA TAC GGG GAT GCC AAA AAC GGC GTG TTt'aAT AAA GAT 
Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 

,;:,37o . .. . -375 y- ^ 380' ' ::: ;' - • : : 

TAT GAT TTc'gGG;:TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG 
Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 

GGA CTC CTT ATG TAT . TTG GGC AAA CCA AAG ' ' 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
405 410 

(2) INFORMATION FOR SEQ ID NO:35: ' ' ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 1230 base pairs - ' ' ' ; 

(B) . TYPE: nucleic acid-' \ ' ' " • ' 
<C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 



> .(ix) FEATURE: ; . - . ' '"rv / O'. • • 

(A) ...NAME/KEY:- CDS if : ' ' 

(B) LOCATION: 1..123 0 • ' • • 

. :(ix) FEATURE:-.--. ^ r. . - 'V'*- ' -O* 

(A) . NAME/KEY: mat^peptide ^ ' ' ' '^ ' 

(B) LOCATION: 1..465 ^ 

(D) OTHER INFORMATION: /product= "bFGF" 

<ix) FEATURE:- • . ' . ' ^- '.I "\ ■ 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 472.. 1230 

(D) OTHER INFORMATION: '/product* "Saporin" 
(xi) SEQUENCE DESCRIPTION r-^ SEQ ID NO:35: 

ATG GCT GCT GGT TCT, ATCkACT-ACT- CTG CCG-GCT'cTG CC<5 ' GAA GAC GGt 
Met Ala Ala Gly Ser lie Thr .Thr Leu Pro Ala- Leii Prd Glu Asp Gly 
1 5 10 15 

GGT TCT GGT GCT .TTC CCG-'.CCC GGC^ CAC TTC AAG- GAC CCC AAG CGG CTg' 
Gly Ser Gly Ala Phe .Pro Pro rely His Phe Lys Asp Pro Lys Arg ^Leu 
20 25 30 

TAC TGC AAA AAC GGG, GGC TTC TTC *CTG CGC- ATC CAC CCC GAC bcC 'CGA 
Tyr Cys Lys Asn Gly ,Gly. Pher Phe Leu Arg lie "His Pro Asp Gly Arg 
35 40 - 45 



1104 



1152 



1200 



1230 



* 1 ■ 



,..48 



, .96 



144 
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GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val A^p Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 24 0 

Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
65 7.0 75 * 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Lev Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 
85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT .AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 
100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr T-yr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Leu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 

ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 528 
lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 
165 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 
180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 624 
Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu 
245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA' 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 
260 265 270 
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TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG 
Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala ^ Gin 
275 280 285 



ATA ACA CAG 
lie Thr Gin 



864 



GGA GAT AAA. AGjT, AGA AAA^^GAAj CTp G <SAC 
Gly Asp Lys^, Ser Arg Lys Glu Leu Gly Leu Gly lie Asp 
290 295 300 



TTA CTT TTG 
Leu Leu Leu 



912 



ACG TTC ATG GAA,.GCA ,GTG ■ AAC ,AAG. AAG GCA 'CGT GTG GTT 
Thr Phe Met Glu . Ala Val Asn Lys Lys Ala Arg Val Val 
305 310 315 



AAA- AAC GAA^ 
Lys Asn Glu 
320 



960 



GCT AGG TTT CTG ,CTT; ATC. GCT. 'ATT. CAA ACA iGCT ' GAG 

Ala Arg Phe Leu Leu Xle Ala lie Gin Met Thr Ala Glu 
325 330 



GTA GCA CGA 
Val Ala Arg 
335 



1008 



Phe 



AGG TAC 
Arg Tyr 



ATT .CAA 
lie Gin 
340 



AAC -TTG 
Asn Leu 



GTA; -ACT 
Val Thr 
345 



AAG.;AAC .TTC CCC- 
Lys Asn Phe Pro 



•AAC AAG 1 X C 
Asn Lys Phe 
350 



1056 



GAC TCG GAT AAC AAG GTG- ATT CAA.„TTT. GAA GTC AGC TGG 
Asp Ser Asp Asn Lys Val lie; Gin Phe Glu Val Ser Trp 
355 360 365 



CGT AAG- ATT 
Arg Lys lie 



1104 



TCT ACG GCA ATA TAC .GGG . GAT GCC AAA:^ AAC GGC GTG TTT 
Ser Thr Ala, I.le Tyr Gly Asp Ala Lys Asn Gly Val Phe 
370 375 380 



AAT AAA GAT 
Asn Lys Asp 



11S2 



TAT GAT TTC. GGG„ .TTT^ GGA AAA. GTG AGG,. CAG i GTG' AAG''^ GAC- 
Tyr Asp Phe Gly -Phe Gly Lys Val Arg' Gin Val Lys Asp 
385 390 395 



TTG'^'CAA-^'ATG 
Leu^ Gin Met 
400 



1200 



GGA CTC CTT . ATG TAT -.TTG .GGQ AAA- CCA AAG ; 
Gly Leu Leu Met iyr- Leu Gly Lys Pro Lys * 
405 410 



1230 



(2) INFORMATION , FOR SEQ ID NO:.36: 

(i) SEQUENCE CHARACTERISTICS: 
, ^ (A) LENGTH: : 768 base pairs. 

(B) TYPE: . nucleic acid 

(C) STRANDEDNESS : .double 

(D) TOPOLOGY: unkno%*n 

(ii) MOLECULE TYPE :. cDNA ^ - 



(ix) FEATURE: 

(A) NAME/KEY: -CDS ....... 

(BK/LOCATION: .4..7.68; 

(D) OTHER INFORMATION: /product = "SAP CYS 



+4" . 



(ix) FEATURE: - ^ , , r 

(A) - NAME/KEY,: CDS : i 

(B) LOCATION: 7 . . 768 
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(D) OTHER INFORMATION: /product = "mature SAP CYS +4 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CAT ATG GTC ACA TCA TGT ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 48 
Met Val Thr Ser Cys Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
1 5 10 15 

CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro 
20 25 30 

AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser 
35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 152 
Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He 
80 85 90 95 

ACT TCC GCC GAG TTA^ ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 
100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn 
115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 
Al^ Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 . • 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 480 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg ' 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
1€0 165 . 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 
180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 624 
Phe. Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 
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195 200* ■ - 205 

AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 672 
Ser Trp Arg Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT; AAA .GAT- TAT:. GAT . ^TTG' GGG- TTT GGA AAA" GTG AGG CAG GTG 720 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA. ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 76 B 

Lys Asp Leu Gin Met Gly Leu Leu Met -Tyr Leu Gly Lys Pro Lys 
240 245 250 .255 

(2) INFORMATION FOR SEQ ID NO: 37: 



(A) . LENGTH:" 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) , TOPOLOGY : > both • ~ 



0^ 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE:- • - ■ " ■ * - ^ - * - ' ' ^' ' * "I \ * 

(A) j NAME /KEY:-: CDS^'- -\ - • * ' ^' ''"'^ " ""'^ \[ 

(B) LOCATION: 4 . . 768 ' 

(D) OTHER INFORMATION: /product* "SAP CYS -►lO" 
, r.:. • • ' i- ■ ^-'^ '-^^'^ ' "' 

(ix) FEATURE:- ' ^ ^ • ^'^^ ' "'^ ' ' * " 

(A) NAME/KEY: CDS 

(B) LOCATION: 7 . . 768 

(D) OTHER : INFORMATION: /products '•'mature SAP CYS +10" ' ' ? 

7"*' '•'^ - • ^ y \ ' '''' ''~ ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CAT ATG GTC ACA. TCA ATC .ACA TTA? GAT-CTA GTA TGT CCG ACC GCG GGT' ' * 48 
Met Val Thr Ser He -Thr Leu Asp Leu Val Cys Pro Thr Ala Gly 

1 5 10 ; , . 15 . I . 

CAA TAC TCA TCT TTT^GTG GAT AAA ATC- CGA AAC AAC GTA AAG GAT' CCA" ' * '* " 96^ 
Gin Tyr Ser Ser Phe Val Asp 'Lys He Arg Asn Asn Val Lys Asp Pro 

20 25 30 . . 

AAC CTG AAA TAC GGT . GGT ACC GAC ATA- GCC GTG ATA GGC ' CCA ' CCT TCT ' '144 

Asn Leu Lys .Tyr Gly Gly Thr Asp iTe Ala Val He Gly Pro Pro Ser 
35 40 .45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ^VCG GTC ' " 192 
Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

/;. ^ .r; : • - * ' .* ~ • ■ 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCiS TAT 'CTT GCA ' 240 
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Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 



ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 
100 105 iio 

CPX3 AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn 

120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 
Als Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 
180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 
195 200 205 



GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 



288 



336 



384 



432 



480 



528 



576 



624 



AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC. AAA AAC GGC 672 
Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Glv 
210 215 220 



720 



AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 768 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
240 245 250 255 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single : . .. ' 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: ' . . - 

(A) NAME/KEY: CDS .. » : A ' ^- • 

<B) LOCATION: 3. .35 '.' 
(A) NAME/KEY: Cathepsin B linker 

(xi) SEQUENCE DESClClPTlbN: SEQ m - * 

CCATGGCCCT GGCCCTGGCC CTGGCCCTGG CCATGG 36 
(2) INFORMATION FOR SEQ ID NO:39: , ' . ' * 

( i ) SEQUENCE ' CHAkACTERI STICS : 

(A) LENGTH:' 51 base pairs 

(B) TYPE: nucleic . acid ^ ^ * • * . ^ r' v. - ' • - 

(C) STRANpEDNESS: .single ^ • i , • - ' • 

(D) TOPOLQGY: .linear 

(ii) MOLECULE ..TYPE,: DNA .(genomic), • ' "*•' '* : 

^ ■' ' " . ^ . . , > . .. . , - ' '''.I '- ' ■ 

(ix) FEATURE:' " *^ , V 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 50 : 'Vj. : V'. ' ' ' " ' " 

" ■ (A) NAI^/KEY: Cathepsin D linker' ^ • >' ' ' ' /| 

(xi) SEQtfeNCE DESCRIPTION: SEQ ID NO: 39 

. -".^ --/^ • -■^•^'^ J-'; " :; 

CCATGGGCCG ATCGGGCTTC CTGGGCTTCG GGTTGCTGGG GTtCGGCA ' ' 51 *' 

(2) INFORMATION FOR SEQ ID NO: 40: 

^''(i) sequence; ,CHARACTERISTiicS:\ ,.vJ - • ' -^^-^ 

(A) LENGTH: 27 base pairs ^ . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: . single _t • ■ / ^- * '* ' " : * 
" - (D) TOPOLOGY: ; linear. . . , • ' 

(ii) MOLECULE TYPE: DNA ' (genomic) 

(ix) FEATURE:';"]' / . . . r- '■ • -I' ' ' • 

(A) NAME/REY:" CDS ' 

(B) LOCATION: 3 . .26 * 

(A) NAME/KEY :..GlY4Ser with Ncol ^ ends *- ' • • ^' • ; ' " ' 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 4 0 
CCATGGGCGG CGGCGGCTCT GCCATGG - " ./ ' ^- . ' ' 27 

(2) INFORMATION FOR SEQ ID NO:41: • • ' ■■ 

(i) SEQUENCE CHARACTERISTICS: ' ' ' ' ' - 

(A) LENGTH: 42 base pairs v.: ' 

(B) TYPE: nucleic acid .. . r 
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(C) STRANDEDNESS : single 
iD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: ' 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 41 

(A) NAME/KEY: (Gly4Ser) 2 with Kcol ends 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCCAT GG 42 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LEJJGTII: 7S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 • . 74 ' 
(A) NAME/KEY: (Ser4Gly)4 with Ncol ends 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42 

CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG GGCTCGTCGT 60 

CGTCGGGCGC CATGG 75 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 45 

(A) NAME/KEY: (Ser4Gly)2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
CCyVTGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 45 
(2) INFORMATION FOR SEQ ID NO: 44 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLCX3Y: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS ' - ' • - 

(B) LOCATION: 3 . . 95 

(A) NAME/KEY: "Trypsin linker ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 • - . 
CCATGGGCCG ATCGGGCGGT GGGTGCGCTG GTAATAGAGt GAGAAiSATCA GTCGGAAGCA' 60 

GCCTGTCTTG CGGTGGTCTC GACCTGCAGG CCATGG - ■ . 9^ 

(2) INFORMATION FOR SEQ ID NO:45: * - 

(i) SEQUENCE CHARACTERISTICS: ' ' 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown ' 



\ - - s . 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

' (A) NAME;/iCEY: CDS 'C, . '.j' . 

(B) LOCATION: 1..18 

(D) OTHER INFORMATION: /product » Thrombin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45- - 

CTG GTG CCG CGC GGC AGC ' . : 

Leu Val Pro Arg Gly Ser - 
1 5 

(2) INFORMATION FOR SEQ ID NO:46: - 

(i) SEQUENCE CHARACTERISTICS: - ' 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double . . • . 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS . . - 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /product* Eriterokinase substrate liriJcer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

GAC GAC GAC GAC CCA 
Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B> LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /products Factor Xa substrate 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

ATC GAA GGT CGT 
lie Glu Gly Arg 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : iinknown 

(ii) MOLECULE TYPE: peptide 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product. Flexible linker > 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:48 

Ala Ala Pro Ala Ala Ala Pro Ala 

1 5 . 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide - , . - . • 

(ix) FEATURE: ' - 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .4 

(D) OTHER INFORMATION: /product^, subtilisin substrate, linker 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
Phe Ala His Tyr ^ ' ^ 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: arnino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: \inknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 1..4 

(D) OTHER INFORMATION: /product 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50^' ' ' ' ' 

Xaa Asp Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE -TYPE: DNA-lg^Bnomxc) * 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5l' 
CTGGCTGCAG TTCTCTCGGC A 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



subtilisin substrate linker 
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(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
TATATGCCAT 6GCCAGAGTC ACTTTATCCT CCAAG 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
TATATGTCGAC TATGGGAGGC TCAGCCCTITGA CA 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
TGAGCGAATT CCATATGGTC ACATCAATCA CATTA 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
TATATGAATT CCATGGCCTT TGGTTTGCCC AAATACAT 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CrHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRA3MDEDNESS : single 

(D) TOPOLOCSY: linear 
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(ii) MOLECUIjE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
TATATGGATC CTATGTGTAG AGTCACTTTA TCCTCCAAG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S7 

TATATAAGCT TCTATGGGAG GCTCAGCCCA TGACA 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 71 base pairs 

(B) TYPE: nucleic acid • " 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



39 



35 



(ix) FEATURE : • . : ; V: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4 . . 771 

(D) OTHER INFORMATION: /product = "SAP 



CYS -1" 



(xi) SEQUENCE DESC:RIPTI0N: SEQIDN0:58: - 

CAT ATG TGT GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG 
Met Cys Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala 
1 5 10 ' ' 15 

GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC' GTA AAA GAT 
Gly Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp 

20 . 25 - . " • * 30 

CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT 
Pro Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro 
35 40 \. 45 

TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG 
Ser Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly -Thr 
50 55 -60 ^ ^ . . 



48 



" • 96 



144 



192 
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GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT 24 0 

Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu 
65 70 75 

GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA 288 
Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu 
80 85 90 95 

ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA 336 
He Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala 
100 105 110 

AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG 384 
Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys 
115 120 125 

AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG 432 
Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu 
130 135 140 

GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA 480 
Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala 
145 150 155 

CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG 528 
Arg Val Val Lys Asn Glu Ala Pixg Phe Leu Leu He Ala He Gin Met 
160 165 170 . * 175 

ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG 576 
Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys 
180 185 190 

AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA 624 
Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu 
195 200 205 

GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC 672 
Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn 
210 215 220 

GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG 720 
Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin 
225 230 235 

GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 768 
Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
240 245 250 255 

TAG 771 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
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(B) TYPE: nucleic acid 

-(C) STRANDEDNESS : single ' • 

(D), TQPOLOGY: linear •; ■ . - / ' 

(ii) MOLECULE TYPE: cDNA 

(xi) S^Qtip?CE. DESCRXPTrON: SEQ :IDND:59: " ' >' ^ ' 

CATATGGTCA CATCATGTAC ATTAGATCTA GTAAAT 36 
(2) INFORMATION, FOR. SEQ ID NO : 60: . . : i ' • . t ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: . 5jp base pairsi.' ^- ..... : , - - * 

(B) TYPE: nucleic acid;; . i 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE- TYPE ::,cpNAv J - =- . ■ • .* : ' • ' ' ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CATATGGTCA CATCAATC AC .ATTAGATCTA GTATGTCCGA CCGCGGGTCA ' 'SO 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE . (CHARACTERISTICS :,. .1 - :* • ^^ '^ ^ . ; . i ^ 

(A) ^LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

, (C)^,.STRANDEDNESS: .single ' T.' r-j};. V D i,: ' ^ . 'u : ^ y^'. 

(D),TQPOLOGY: linear - .' \ 'v' > ^ • ' ' ' ' 

• ^ • ^ J ' * ■■* 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQlJEaJCE DESCRIPTION;: SEQ: ID NO :~61 : *-^ : " - - 

TTTCAGGTTT GGATCTTTTA CGTTGTTT 28 

(2) INFORMATION FOR SEQ ^IDi NO.: 62 : ' ^1 ' * 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: ^2 8/. base, pairs • ' ' '■ '"'^ ' 

(B) TYPE: nucleic acid - ' *. • . ^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLEtrULE TYPE: cDNA . . . * 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
AAACAACGTA AAAGATCCAA ACCTGAAA 28 
(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGPTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: \inknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product- nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

Ala Pro Arg Arg Arg Lys Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Lys Arg Lys Lys Lys 
I S 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .5 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

lie Arg Val Arg Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 66: 
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(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: € amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.'.6- ' 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

SEQUENCE DESCRIPTION: SEQ ID NO: 66 

Lys Arg Lys Arg Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: i • 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 
(D) OTHER INFORMATION: /products nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67" ^ ' * 

Pro Lys Lys Arg Lys Val Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xinknown 

;.,(ii) MOLECULE TYPE:- peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 
(D) OTHER INFORMATION: /product = nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68 ' ' ' 




(ix) 



(xi) 
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Pro Pro Lys Lys Ala Arg Glu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

<A) NAME/KEY: CDS 
(B) LOCATION: 1 . . 9 

(D) OTHER INFORMATION: /product- nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 

Pro Ala Ala Lys Arg Val Lys Leu Asp 
15 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single -* 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 

Lys Arg Pro Arg Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 1..5 " * 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 ' " " 

Lys lie Pro lie Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE. TYPE: peptide 

(ix) FEATURE: -~' 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 9 
(D) OTHER INFORMATION: /product* nuclear ' translocation sequence 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 72 

Gly Lys Arg Lys Arg Lys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: unknown 

(ii.) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 9 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73 '''''' 
Ser Lys Arg Val Ala Lys Arg Lys leu - ' ^ . » 

(2) INFORMATION FOR SEQ ID NO:74: ' - * . * ' 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 74 

Ser His Trp Lys Gin Lys Arg Lys Phe 

1 5 . 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product- nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 

Pro Leu Leu Lys Lys lie Lys Gin 
15 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product* nuclear treuis location sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

Pro Gin Pro Lys Lys Lys Pro 
15 

(2) INFORMATION FOR SEQ ID NO: 77: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLCX3Y: unknown 



( ii ) MOLECULE .-.TYPE : peptide ^ ' ' 

(ix) FEATURE: • ' • ' ' ■ ■ ^. 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 15 ; 7 

(D) OTHER INFORMATION: /product = nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 ' ' " * 

Pro Gly Lys Arg Lys Lys Glu Met Thr Lys Glh Lys Glu Val Pro 

1 5 • 10 ■ ' 15 

(2) INFORMATION FOR SEQ ID NO: 78: 'T. : 

(i) SEQUENCE CHARACTERISTICS : ' ' ' ''^ ' " - - - 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid .* ' * 

(C) STRANDEDNESS : single ' * . / ' 
(D> TOPOLOGY : unicnown ' ■ 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS ' ' ^' ' 1' " 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 

Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro^ ' ' 

1 5 10 ' \ ' ' 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .7 
(D) OTHER INFORMATION: /product = nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
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Asn Tyr Lys Lys Pro Lys Leu 
1 5 

(2) INFORMATION. FOR SEQ ID NO: 80: . . 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 

His Phe Lys Asp Pro Lys Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

. . (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 783 

(D) OTHER INFORMATION: /product^ "Airplified SAP with EcoRl 
ends" 

(xi) SEQUpJCE DESCRIPTION: SEQ ID NO: 81: 

GAATTCCAT ATG GTC ACA TCA ATC ACA TTA GAT CTTA GTA AAT CCG ACC 48 
Met Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr 
1 5 10 

GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG 96 
Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys 
15 20 .25 

GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA 144 
Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro 
30 35 40 45 
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CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 192 
Pro Ser Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly 
50 55 60 

ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT 240 
Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val 7U.a Tyr 

65 70 :'y- -75 ■ ^' ' - ■ 

CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA'TAT TAC TTC AAA TCA 288 
Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr -Tyr Phe Lys Ser 
80 85 - = • ' 90 

GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT " ' 336 
Glu lie Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr 
95 100 105 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG tCG ATC' GAA 384 
Ala Asn Gin Lys Ala Leu Glu; Tyr Thr Glu . Asp - Tyr bin" Ser' He Glu 
lib 115 120 125 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 4 32 

Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg^'Lys ■ Glu Leii' Gly 
130 135 ^ 140 

TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GGA GTG ^ '/Ijic ' AAG^ 'AAG ' 480 

Leu Gly lie Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys 

145 150 VV/ '1^.5^^: ' 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG -CTT^-ATC 'GCT Aft C^ 528* 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu "lieu He Ala^ II^ Gin | 
160 165 170 -^ '"^ ■ ' 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT ' 576 
Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 
175 180 185 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC HAG GTG ATT CAA TTT' 624 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val tie Gin Phe 

190 , .195 .: r~ . . ' • 200 - * . " 205 

GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA 672 
Glu Val Ser Trp Arg Lys He Ser Thr 'Ala He Tyr Gly Asp Ala- Lys '* 
210 215 220 

AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG '* 720 
Asn Gly Val Phe Asn Lys Asp: Tyr Asp Phe Gly Phe Gly Lys Val Arg 
225 230 235 

CAG GTG AAG GAC TTG GAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA * 768 
Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro 
240 245 250 

AAG GCC ATG GAA TTC : " * :* . ' ' ' " ' ' '■ ■ ' * 783 ' 

Lys Ala Met Glu Phe 
255 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 
{Cy STRANDEDNESS : single 
(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1005 

(D) OTHER INFORMATION: /product = "SAP-HBEGF" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA 48 
Met Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin 
1 5 10 15 

TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC 96 
Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn 
20 25 so 

CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA 144 
Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys 
35 ' 40 45 

GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA 192 
Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser 
50 55 . 60 

CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG 240 
Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met 
€5 70 75 80 

GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT 288 
Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr 
85 .90 .95 

TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG 336 
Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin 
100 105 110 

AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC 384 
Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala 
115 120 125 

CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC 432 
Gin lie Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie 
130 135 140 
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GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG "aAG GCA" CGT GTC 
ASP Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys .Ala 2g 

- ..^"^Z ^ " . i«9 

GTT AAA AAC GAA GOT AGG TTT CTG CTT ATC GCT "ATT CAA ATG Ar-j. 
val Lys Asn Glu Ala Arg Phe Leu Leu lie SS "l" gJ^ SI? JS.Sa 

Glu val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 
180 185 190 

CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC A-- 
Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu SI? Se^ 

195 :. . ■ -^200 • ■•• • - ias •• 

TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GG" 

Trp Arg Lys He Ser Thr Ala IleTyr Gly Asp aL^lJ^ Asn ^ij Va?^ 



220 



TTT AAT AAA GAT TAT .GAT TTC GGG- TIT GGA AAA GTG 'AGG CAb (^TG Xag ^ 
Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 

"° . , 235 . . ^ , , , ^. 240 , 

^P^IZIX. mI^ 2f <^ ^ TAT TTG GGC AAA C^ ^ ^C^ 1^ " 

Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys Ala M4t 

^^.^ ., 250 _ ^ . . 25.5, ,,,,, 

GCC'aGA GTC apt: TTA TCC TCC AAG CCA CAA GCIA CTG Gc6 AtA CdA ;^C ' 
Ala Arg Val Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr 

,,. ^ _ 265 27.0 ... 

AAG GAG GAG CAC GGG AAA AGa' ^G AAG AAA 6GC AAG GGd CItA GGG' aAg' '" 
I^ys Glu Glu His Gly Lys Arg Lys Lys Lys Gly Lys Gly Su ^ 

280 285 

AAG AGG GAC CCA TGT.. CTT CGG AAA" TAC AAG GAC TTC Tbc ATC CAC G^A ' 
I-ys Arg ASP Pro Cys Leu Arg Lys Tyr Lys Asp Phe C^s ill Ss Sy 

295 300 . _ 

GAA TGC AAA TAT GTG ^ AAG. GAG. CTG CGG GCT CCC TCC^TGC-ATC" TGC CAC ' " " 
Glu Cys l.ys Tyr Vai Jys Glu I^eu Arg Ala Pro Ser Cys He Cys His 

CCG GGT TAT CAT- GGA..GAaiAGG . TGT • CAT GGG CTG AGc'cTC CCA TA 
Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu ^ Ser Leu Pro 

330 335 



460 



528 



576 



624 



720 



768 



816 



864 



912 



960 



1005 



i 2 ) INFORMATION. FOR . SEQ ,;ID. .NO : 83 : - 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LEKGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..240 

(D) OTHER INFORMATION: /product- "MET-CYS-HBEGF" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

ATG TGT AGA GTC ACT TTA TCC TCC AAG CCA CAA GCA CTG GCC ACA CCA 48 
Met Cys Arg Val Thr Leu Ser Ger Lys Pro Gin Ala Leu Ala Thr Pro 
1 5 10 15 

AAC AAG GAG GAG CAC GGG AAA AGA AAG AAG AAA GGC AAG GGG CTA GGG 96 
Asn Lys Glu Glu His Gly Lys Arg Lys Lys Lys Gly Lys Gly Leu Gly 
20 25 30 

AAG AAG AGG GAC CCA TGT CTT CGG AAA TAC AAG GAC TTC TGC ATC CAC 144 
Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys lie His 
35 40 45 

GGA GAA TGC AAA TAT GTG AAG GAG CTC CGG GCT CCC TCC TGC ATC TGC 192 
Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys lie Cys 
50 55 60 

CAC CCG GGT TAT CAT GGA GAG AGG TGT CAT GGG CTG AGC CTC CCA TAG 240 
His Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser Leu Pro 
65 70 75 80 

(2) INFORMATION FOR SEQ ID NO : 84 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: xmknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..24 9 

(D) OTHER INFORMATION: /product* 

-MET-CYS - ALA-MET-ALA-HBEGF " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 



ATG TGT GCC ATG GCC AGA GTC ACT TTA TCC TCC AAG CCA CAA GCA CTG 
Met Cys Ala Met Ala Arg Val Thr Leu Ser Ser Lys Prd Gin Ala Leu 
^ 5 10 



15 



GCC ACA CCA AAC AAG GAG GAG CAC GGG AAA AGA AAG AAG AAA GGC AAG 
Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg Lys Lys Lys Gly hys 
20 25 30 

GGG CTA GGG AAG AAG AGG GAC CCA TGT CTT CGG AAA TAC AAG GAC TTC 
Gly Leu Gly Lys Lys Arg Asp- Pro Cys Leu Arg Lys Tyr Lys Asp Phe 
35 40 45 

TGC ATC CAC GGA GAA TGC AAA TAT GTG AAG" GAG CTC CGG GCT CCc' TCC 
Cys He His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg Ala Pro Ser 

TGC ATC TGC CAC CCG GGT TAT CAT GGA- GAG AGG TGT CAT GGG CTG AGC 
Cys He Cys His Pro Gly Tyr His Gly Glu Arg Cys His Gly Leu Ser 

€5 ... . ... ^ , 7,0' ■ ■■' ' *• ' ■ ' ' 



CTC CCA TAG 
Leu Pro 



: 75.. - ■ . 3^ 



.M.J 



(2) INFORMATION FOR . SEQ. ^ID : NO : 85 : :i - 

(i) SEQUENCE CHARACTERISTICS: 

(A), LENGTH: 5 .amino acids - ' " 
(B h TYPE : % amino acid „ ' :.r. ... . ^ . ' ,. 

(C) STRANDEDNESS :? single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE peptide a - - '-ii ^? • " ^ ••^* - ' ' " - • 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Cys Ala Met Ala ' 
1 5 

(2) INFORMATION FOR SEQ ID NO:86: ; ^ " - * 

(i) SEQUENCE CHARACTERISTICS: - ' 

(A) LENGTH: 51 base pairs - ' 1 ^ • " 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

( ii ) MOLECULE TYPE : DNA (genomic ) . . - . 

(xi) SEQUENCE DESCRIPTION: SEQ ^ID NO: 86 : ' ' ' 
TATAGGATCC TGATGTGTGC CATGGCCAGA GTCACTTTAT CCTCCAAGCC A 



48 



96 



144 



192 



240 



249 



51 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

lie Lys Arg Leu Arg Arg 

15 
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Claims 

1 . A conjugate, comprising a t^geted agent and a heparin-binding 
epidermal-like growth factor (HBEGF) polypeptide or a portion thereof, wherein the 
conjugate binds to a HBEGF receptor resulting in internalization of the linked targeted agent 
in cells bearing the receptor. 

2. A conjugate, comprising the following components: (HBEGF)n, (L)q 
and (targeted agent)ni, wherein: 

L is a linker; 

HBEGF is a HBEGF polypeptide; 

at least one HBEGF polypeptide is linked at any residue in the polypeptide via 
(L)q to at least one targeted agent; 

m and n, which are selected independently, are at least 1 ; 

q is 0 or more as long as the resulting conjugate binds to the targeted receptor, 
is internalized and delivers the targeted agent; and 

the conjugate binds to a receptor that interacts with and internalizes HBEGF, 
whereby the targeted agent(s) is internalized in a cell bearing the receptor. 

3. The conjugate of claim 2, wherein m and n, which are selected 
independently, are from 1 to 6. 

4. The conjugate of claim 2, wherein q is 1, n is 1 and m is 1 . 

5. The conjugate of claim 2, wherein L is selected from the group 
consisting of protease substrates, linkers that increase the flexibility of the conjugate, linkers 
that increase the solubility of the conjugate, linkers that increase the serum stability of the 
conjugate, photocleavable linkers and acid cleavable linkers. 

6. The conjugate of claim 5, wherein the linker is selected from the group 
consisting of cathepsin B substrate, cathepsin D substrate, trypsin substrate, thrombin 
substrate, recombinant subtilisin substrate, (GlyinSerp)n, (SerinGlyp)n and (AlaAlaProAla)n 
in which n is 1 to 6, m is 1 to 6 and p is 1 to 4. 
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7. The conjugate of claim 2, wherein L is selected from the group 
consisting of (Gly4Ser)q and (Ser4Gly)q, where q is 1 to 4. 

8. A conjugate of any one of claim 1 or 2, wherein a HBEGF polypeptide 
is selected from the group consisting of mammalian HBEGF polypeptides and HBEGF 
polypeptides in which a cysteine residue is added or replaces a non-essential amino acid 
residue within about 20 amino acids of the N-terminus or C-terminus of the polypeptide. 

9. The conjugate of any one of claim 1 or 2, wherein the targeted agent is 
a cytotoxic agent. 

1 0. The conjugate of any one of claim 1 or 2, wherein the targeted agent is 
a ribosome-inactivating protein. 

11. The conjugate of claim 1 0, wherein the targeted agent is a saporin. 

1 2. The conjugate of any one of claim 1 or 2, wherein the targeted agent is 

a nucleic acid. 

13. The conjugate of claim 2, wherein the conjugate that is a fusion protein 
selected from the group consisting of FPHl, FPHSl, FPHS2,FPHS3, FPttS4, FPHS5, FPSHl 
andFPSH2. ; ' 



14. A conjugate comprising a polypeptide of the formula: targeted agentn- 
(L)q-HBEGFni or HBEGFm-(L)q-targeted agentn, wherein: 
L is a linker; 

HBEGF is a HBEGF polypeptide; 

at least one HBEGF polypeptide is linked at any residue in the polypeptide via 
(L)q to at least one targeted agent; 

m and n, which are selected independently, are at least 1 ; 

q is 0 or more as long as the resulting conjugate binds to the targeted receptor, 
is internalized and delivers the targeted agent; and 

the conjugate binds to a receptor that interacts with and internalizes HBEGF, 
whereby the targeted agent(s) is internalized in a cell bearing the receptor. 
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1 ^. The conjugate of claim 1 4, wherein each HBEGF polypeptide in the 
conjugate is independently selected from the group consisting of HBEGF polypeptides and 
HBEGF polypeptides in which a cysteine residue is added or replaces a non-essential amino 
acid residue within about 20. amino acijciSjOfthe N-terminus of the polypeptide. 

16, TTie. conjugate of my one . of claims 1-15 for use asi an active 
therapeutic substance.. , . - - ^.,/ t -f n . 

, : conjugate of any one of claims 1-15 for use in the manufacture of 
a medicament for u-eating an HBEGF-mediated pathophysiological condition. 

1 8 . The conjugate of claim, 1 7, wherein the pathophysiological condition is 
a solid tumor in which the cells bear receptors to which HBEGF binds; a dermatological 
disorder involving epidermal cells, an ophthalmic disorder involving proliferation of 
epithelial cells, of a disorder characterized by proliferation of smooth muscle tells. 

19. The conjugate of any one 
proliferation of cells bearing HBEGF receptors. 

-J: ?9' , c^^^ effecting gene thetapy, wherein 

the cpnjugat^ inclu^^ a nuclearj translocatipn;s6querice(6peia^ the targeted ' 

nucleic acid or to a HBEGF. 



V . ■^P^'^ fragment comprising a sequence of nucleotides encoding the 
conjugate of any one of claims l-4.and 6-15. • *. ^ ' r.. 



A plasmid, comprising the DNA of claim' 21 . 



23. The plasmid of claim 22, wherein the plasmid is an expression vector 
for expression of the D>^ A encoding the conjugate in eukaryotic cells or is an expression 
vector for expression Qf ;|ie cpnju^ . 

24 . The exp;:ession vector of claim 23, comprising a DNA encoding a 
secretion signal seque^vc|?,operatiyely linked tO;th.e DNA encoding the conjugate. 
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25. The expression vector of claim 24, wherein the secretion signal is 
selected from the group consisting of OmpA, OmpT, phoA, bacterial alkaline phosphatase, 
pelB, the insulin leader sequence, mammalian alkaline phosphatase, growth horaione leader 
sequence and mellitin. 

26, The plasmid of claim 22 that is selected from the group consisting of 
P230B1, PZ31BI, PZ32B1, PZ33BK PZ34B1, PZ35B1, PZ36B1 and PZ37BL 



23. 



27. A cell transfected or transformed with the expression vector of claim 

28. The cell of claim 27 that is a bacterial cell. 

29. The cell of claim 27 that is an insect cell. 

30. A method of producing a HBEGF conjugate, comprising culturing the 
cells of claim 27 under conditions whereby DNA is transcribed and translated to produce the 
conjugate. 

31. A heparin-binding epidenmal growth factor-like growth factor 
(HBEGF) polypeptide that is modified by insertion of a cysteine residue or methionine- 
cysteine within or at about twenty amino acids of the N-terminus or C-terminus, wherein the 
inserted residue replaces a nonessential residue in an unmodified HBEGF polypeptide or is 
added to the HBEGF polypeptide. 

32. The modified polypeptide of claim 3 1 , wherein the cysteine residue is 
inserted within or at about 1 0 residues from the N-terminus. 

33. The modified polypeptide of claim 32 that is Met-Cys-HBEGF. 

34. A DNA fragment comprising a sequence of nucleotides encoding the 
modified polypeptide of claim 3 1 . 

35. A plasmid selected from the group consisting of PZ381, PZ39I, PZ40I 

andPZ41I. 
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36. TTie modified polypeptide ofclaim 31, selected from t^^ 
consisting pfFPSH2,FPSH3,FPSH4 and FPSH5; ' • . . 

37. A pharmaceutical composition comprising a conjugate according to 
any one of claims 1-4 or 6-16, in combination with a physiologically acceptable excipient. 

38. A method of producing ah HBEGF fiision protein, comprising: 

(a) culturing cells transformed with a plasmid containing a DNA fragment 
according to qlaim 21, under conditions whereby the DNA fragment is transcribed and 
translated; 

(b) lysing the transformed cells in a buffer containing urea to form a lysate 
containing an HBEGF fusion protein; . . y }, ' : i; ' . - . * 

(c) applying the lysate to a cation-exchange chromatography resin; 

(d) eluting the HBEQF, fusion protein from the cation-exchange 
chromatography resin of step (c); 

(^) passing. the HBjEGF fusion protein over ah anion-exchiange 
chromatography resin; : - - ; v i ( ; 

(f) applying the HBEGF fusion protein to an anion-exchange 
chromatography resin; 

^ I J. ; (g) « ^ ^i^V^S ^he HBEGF .ftisidn protein from ^the -cation-^ 
chroniatography resin of step (f); , , / : t'r:-> r :;ri • ^. I "i. 

Xh) applying the HBEGF :fusi6n protein to^a hydrophobic interaction ' 
chromatography resin; ^ t. ^ ■ . r ^ .i : :.:v. : :i. 

(i) eluting the HBEGF fiision protein from the hydrophobic interatction ' 
chromatography resin of step (h); and 

0) recovering the HBEGF fusion protein from a size exclusion 
chromatography resin. 
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